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STATEMENT OP FOCUS 



The Wisconsin Research and Development Center for 
Cognitive Learning focuses on contributing to a better 
understanding of cognitive learning by children and 
youth and to the improvement of related educational 
practices. The strategy for research and development 
is comprehensive. It includes basic research to gen- 
erate new knowledge about the conditions and processes 
of learning and about the processes of instruction, and 
the subsequent development of research-based instruc- 
tional materials, many of which are designed for use by 
teachers and others for use by students. These materials 
are tested and refined in school settings. Throughout 
these operations behavioral scientists, curriculum ex- 
perts, academic scholars, and school people interact, 
insuring that the results of Center activities are based 
soundly on knowledge of subject matter and cognitive 
learning and that they are applied to the improvement of 
educational practice. 

The Technical Section, a support activity, functions 
to identify and invent research and development strategies, 
to assist in the conduct of research and development pro- 

• • • 

111 



grams, and to train graduate students and other research 
personnel in research and development strategies. This 
Technical Report describes a contribution of the Technical 
Section staff toward the invention of research and develop- 
ment strategies. 
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ABSTRAC T 



This study tested the possibility of using the optimi- 
zation techniques of the response surface model (which 
visualizes the totality of responses to all possible com- 
binations of k independent variables as a surface or mani- 
ford in a (k+1) -dimensional space) in maximizing the re- 
sponse to a concept learning task. This experimental 
approach is unique in that a sequence of experiments are 
performed with the combinations of levels of independent 
variables comprising the later experiments being determined 
by the results of the earlier experiments. 

A secondary purpose was the investigation of the extent 
to which findings from concept learning experiments in lab- 
oratory settings would generalize to the learning of types 
of concepts typically taught in schools. The concept learn- 
ing tasks used consisted of the learning of a series of plane 
geometry concepts, quadrilateral, isosceles triangle, trape- 
zoid and rectangle, presented by series of slides and synchro 
nized tape recorded verbal cues. The response, mean level of 
performance on a recognition test given 21}. hours after the 
slide presentation, was seen as a function of these variables 
1 ) amount of redundant information, 2) mode of presentation 
of successive instances, 3) ratio of numbers of positive to 
negative instances, i}.) order of positive and negative 



instances, 5) accompanying verbal cues, 6) time slides were 
available to subjects, 7) time between successive slides, 
and 8) relative complexity of the concept. 

A sequence of sixteen experiments were run using 
fifth grade children from sixteen small town elementary | 

schools. While the sequence of experiments did not prove 
long enough to produce a maximum response, the following 
results were obtained. 

The variables of verbal cue, time per slide, and 
ratio of positive to negative instances produced the 
greatest increments in response. 

The design did move in the direction of a maximal 
response, although not as rapidly as expected. Desirable 
features of this experimental approach, such as, the 
ability to expand or contract the number of variables, 
were demonstrated. 
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CHAPTER I 



NATURE OP THE PROBLEM 

Human behavior, even in its simplest and least 
complicated psychomotor forms, tends to bo involved. 

The cognitive aspects of human behavior are even more 
complex. While recognizing that a basic purpose of 
science is to cut away the superfluous, to abstract 
and analyze into essentials and, in general, to aim 
at simple explanations of phenomena being studied, it 
is nonetheless true that researchers often attack prob- 
lems within ail overly simple theoretical and experi- 
mental fraiiioxvork solely because the tools do not exist 
for conveniently handling the problem in all its com- 
plexities. Thus, the scientific study of psychology 
and education has repeatedly resorted to simple models 
including but a few of the many admittedly interrelated 
variables which might bo determinants of the behavior 
under study. The ability to study human behavior in 
appropriately complex ways is a necessity for the in- 
creasing understanding and prediction of that behavior. 

Statisticians viorking in the physical sciences and 
in industry have been concoriied for a number of years 
with enalogous problems in attempting to optimize pro- 
duction outputs vjhlch are seen as complex functions of 
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many input variables and in estimating the nature of 
these complex functional relationships. Some of the 
more interesting and promising idoas have appeared in 
what has come to be known as response sui'faco method- 
ology. While the physical scientist has the advantages 
of working with materials that are highly homogeneous 
and subject to experimental manipulation free from ethi- 
cal restraints, and with variables that can be tightly 
controlled and accurately measured on continuous, inter- 
val or ratio scales, this differs from the situation of 
the behavioral scientist more in degree than in kind. 

Loss than twonty-five years ago Hotelling wrote; 

The possibilities of improvement of 
physical and chemical investigations 
based on the theory of statistical 
inference have scarcely begun to be 
explored. *:s- 

While today this statement is no longer true with regard 
to the fields of which Hotelling was speaking, it can 
still be applied to much of current educational and 
p s y cho logic al exp or i men t a t i on . 

As Baker (196?) has pointed out, at a time when 
adequate financing is readily available for large scale, 
sustained investigation of broad problem az*eas, educa- 
tional experimentation is still taking place lar’gely 



•jj-IIo telling, Harold. Some improvements in weighting 
and other experimental techniques. A.nn. Math. Stat., 19iili, 
1_3, 297. 
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within tho framework of a theory of experimental design 
developed to solve the difficulties of relatively small, 
stand-alone, agricultural experiments. While these tech- 
niques are still valid, they, in general, are difficult 
to apply to the complex, multifaceted type of research 
problems currently being funded. New techniques are 
needed with the capability to handle the sequential 
investigation of complex and often ill-defined problems. 
Among the capabilities required are those of integrat- 
ing knowledge from earlier sub -experiments in modify- 
ing later ones, of being able to introduce new variables 
into tho system and rejecting those which seem unimpor- 
tant, and of adapting to changing criterion measures. 

Ono promising area of investigation is the above mentioned 
response surface approach. 

Statement of the Problem 

This dissertation is the result of a simultaneous 
interest in, and concern for, two quite different prob- 
lem areas, one methodo3.ogical and tho other substantive. 
The mothodological interest is with tho topic of re- 
sponse surface methodology and the problem of applying 
this powerful toclmique to situations where the response 
of interest is some type of human behavior. The substan- 
tive concern is with the area of concept attainment or 
learning. The analogy between industrial production and 
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psychological production seemed a particularly apt one 
and led to this attempt to use the methodology developed 
in one in experimentation with the other. 

Box and Wilson (1951) published a basic theoreti- 
cal paper concerned with the optimization of responses 
in the chemical industry, upon which Box and others 
(Box, 1954> Box and Hunter, 1957) have since elaborated 
and enlarged. As they stated the problem one is inter- 
ested in a response which is supposed dependent upon a 
number of quantitative factors or variables, each capa- 
ble of exact measurement and control. The response is 
subject to error. These k independent variables plus 
the response measure define a (k+1) -dimensional space. 
Each combination of variables with the attendant ex- 
pected response determines a point in this space. The 
totality of all such points forms a surface over the 
k-dimensional subspace spanned by the independent vari- 
ables. This surface is the response surface. 

Within the entire multidimensional space one may 
identify a region, delimited by practical considerations, 
which Box terms the experimental region. The problem 
of exploring this experimental region has two aspects. 

The first is that of determining the approximate sub- 
region in which the responses are optimal. This is 
accomplished by designing a sequence of experiments 
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which permit on© to move from any arbitrary starting 
point toward points of maximum or minimum response. 

The procedure used is generally some variation of a 
steepest ascent procedure first suggested by Box and 
Wilson (1951). oaie second aspect is that of examining 
this subregion of optimal response with the purpose of 
estimating the functional relationship between the inde- 
pendent variables and the response within this neighbor- 
hood. lUiis is accomplished by saturating the subregion 
with a sufficient number of experimental points to allow 
estimation of the function by a polynomial of minimally 
sufficient degree. Knowledge of this functional relation- 
ship results in a more accurate determination of the op- 
timal combination and permits one to set production 
levels based on the relative costs of maintaining the 
optimal levels of the various factors. 

Box (1957) has carried this idea oven further, sug- 
gesting that full-scale industrial production processes 
have continuing experimentation built into them. Small 
systematic variations in the independent or input vari- 
ables can be programmed into regular production runs 
according to a specified experimental design. Those 
L variations, while within limits imposed by quality con- 

trol, are yet large enough to generate detectable ef- 
fects. These effects can then be analyzed and used to 
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provide the basis for computing new experimental combina- 
tions aimed at optimizing production. 

Educators and educational psychologists have ana- 
logous problems in attempting to maximize the amount of 
learning vjhich takes place in classroom and laboratory. 
There can be little doubt that the problem of maximiz- 
ing learning is, within certain practical limitations, 
of major concern to educators, if not the major concern. 
Obviously the correspondences to industry are not one to 
one, and any insistence on such correspondences would 
be tenuous at best. The learning process is perhaps 
far more complicated than those found in industrial 
production, functional relationships are loss well 
understood, variables are often discrete rather than 
continuous, measurement scales are frequently nominal 
or ordinal in nature, and the raw material is far less 
tractable to experimental manipulation. Nevertheless, 
both in education and psychology, new techniques for 
dealing experimentally with multifactor or multivariate 
problems are needed and the analogy betv;een industrial 
production and educational production is close enough 
to suggest that the response surface model may have value. 
Furthermore, the increasing control of the stimulus 
inputs to the learning situation vjhich are certain to 
result as the full impact of the cybernetic revolution 
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reaches education will lead to even closer analogies 
betvjoen the two tyi^es of production. The major dif- 
ficulty is in formulating approximations which will per- 
mit the use of techniques developed for continuous data 
with discrete variables measured on an ordinal or quasi - 
interval metric. 

One area of educational and psychological investi- 
gation in which a largo number of important variables 
have been identified and to which the response surface 
model has relevance is in that of concept attainment or 
concept learning. In recent years the psychological 
literature has had a plethora of experimental studies 
concerned with various aspects of how humans attain, 
acquire, develop, form, identify or lea.rn concepts. 

While some semantic difficulties exist over this vari- 
ety of verbs and over the lack of an authoritative 
definition of the word, concept, it is probably accurate 
to state that most of those studies pertain to varied 
aspects of a single, highly complex, cognitive process. 
Further, there can be no question that a basic goal of 
education and educational psychology is the maximization 
of the results of this process. 

Klausmeier at (1965) in identifying and cate- 
gorizing variables which appeared to be determinants of, 
or related to, adequate concept learning behavior. 
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compilod a list of more than one hundred variables. 

Many of those, although discrete in nature, are quanti- 
tative rather than qualitative variables and thus meet 
the basic requirement of the response surface model. 

A number of these, related in that all are aspects of 
the presentation of stimulus materials, seemed particu- 
larly appropriate for this purpose and constitute the 
independent variables studied in the empirical part of 
this paper. 

A further aspect of the problems being investigated 
stems from difficulties with the nature of much of the 
recent concept learning experimentation as it related 
to education. An examination of the scores of experi- 
mental studies devoted to various aspects of concept 
learning leads to the conclusion that psychologists 
have compiled a considerable amount of information which 
should bo relevant to the learning of the types of con- 
cepts taught in schools. However, closer examination 
raises the question of whether or not that which the 
psychologist in his experimentation has labeled a con- 
cept bears an adequate enough resemblance to the type 
of concept taught in the classroom to bo of value to 
educators. 

It appears to this writer that there are decisive 
differences betvJGon the two dofinitions of concept. A 
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major difficulty in identifying the exact points of 
difference is the lack of exact definition of what is 
meant by the word concept, either by the psychologist 
or by the educator. Further complicating the semantic 
problem is the fact that concepts are viewed as something 
to be learned, farmed, attained, identified, evolved, 
acquired or developed, and that these processes may or 
may not be synonomous, depending upon the researcher. 
Identical experimental tasks are often used as experi- 
mental analogs of psychological experiences which the 
individual researchers see as different. 

Psychological experimentation on the problems of 
how one attains a concept has relied, to a considerable 
extent, upon variations of a single typo of stimulus 
material. Reference here is made to the familiar sets 
of cards representing all possible combinations of two 
or more figures, kind of shapes, colors, textures, bor- 
ders, etc. One finds reference to the WCST (Wisconsin 
Card Sorting Task), the NYU Card Sorting Test, and many 
other unique variations which appear only once in the 
literature. Some experimenters have presented the cards 
in sequence while others have presented the entire array 
simultaneously. However, with all this surface var- 
iability all the tasks still retain basic underlying 
similarities indicating a widely accepted common idea 
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of what the experimental psychologist means by concept. 
That common idea is that a concept is an arbitrary rule 
for dichotomizing a universe of stimuli into those stimuli 
which belong and those vjhich do not. 

On the following major points these experimental 
tasks, and the definition implicit in them, provide 
an inadequate experimental sinalog for the behavior of 
the classroom learning of a concept. Each of these sets 
of stimulus materials is, first of all, restricted in 
size. A small, finite number of dimensions or at- 
tributes are all that are allowed and, typically, only 
two to four distinguishable values on each of theso 
dimensions. The result is a complete set of inst5:tuces 
consisting of all possible combinations of one value 
from each dimension which usually numbers from sixteen 
to one hundred twenty-eight unique combinations. Smaller 
sets tend to be trivial for the college populations from 
v/hich most subjects have been drawn and larger sets tend 
to be so difficult to work with that subjects cannot 
solve the problems presented in reasonable time limits. 

A second characteristic of these stimulus mater- 
ials is that the materials may be readily identified by 
the subject in words which are part of his existing 
vocabulary. For example, square, diamond, circle, red, 
blue, one, tv;o, large, small, etc., are all words which 
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are readily available to most grade school youngaters, 
not to mention the college students who represent the 
majority of subjects in concept attainment or concept 
learning experiments. 

Frequently, the subject is presented the stimulus 
material in an arbitrary, but highly systematic, order 
which emphasizes the various dimensions ,on which the 
set of materials are organized. This order also may 
operate to reduce the memory load of a subject attempt- 
ing to master the task and may even suggest a strategy 
or procedure for attack upon the task. 

The usual task set for the subject in a concept 
learning experiment is the identification of an arbi- 
trary rule which experimenter has used to classify 
the stimulus materials into sets of exemplar's and non- 
exemplars of the ’’concept.” This may be an explicit 
statement of the rule or may be inferred from the sub- 
ject’s demonstrated ability to perform as if he knew 
the rule, as in a card sorting task. Thus, the subjects 
in these experiments are doing nothing more than attempt- 
ing to infer a rule for dichotomizing the stimulus mat- 
erials into those which belong to some arbitrary subset 
and those which do not. Since the dimensionality of 
the system is finite, it is always possible to unam- 
biguously come up with the exact rule for categorizing. 
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These tasks differ from those faced by a young- 
ster trying to learn a concept in school in at least 
those three basic aspects. First, the youngster must 
learn a new word or a new use for a familiar word along 
with the concept. A vocabulary problem exists which 
is not a part of most laboratory tasks. Second, neither 
the exemplars of the concept nor the dimensions to be 
considered can be enumerated completely, let alone ob- 
served in their entirety, so that an unambiguous or 
complete attainment of a concept is impossible. Fin- 
ally, the acquisition of a new concept usually calls for 
the student to categorize his environment in, what to 
him, is some novel fashion whereas categorizing geometric 
figures as red or black, large or small, square or cir- 
cle can hardly be called novel for any population older 
than six or seven. 

Tlius, there exists the need for a more realistic 
experimental analog to concept learning behavior as 
experienced by children in schools. A task is needed 
incorporating the learning of a label along with the 
concept; a task which has built into it an indetermin- 
ateness which prevents the learning of a concept in any 
corapleto way and v;hich forces the subject to organize 
relevant aspects of his environment in somewhat novel 



ways . 
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Related to these difficulties with the typical 
concept learning task is a similar kind of problem with 
the response measures used. When a teacher says that a 
child has attained, formed or learned a new concept, he 
is indicating that the child has added something of some 
permanence to his cognitive repertoire. The response 
measure of interest to that teacher is some form of 
retention measure rather than one of immediate learning. 
Yet in the great majority of concept learning studies 
the response measure used is some concommitant of im- 
mediate learning. 

purpose of the Study 

A situation exists whore researchers in education 
have need of methodological tools capable of handling 
more complex and less well-defined formulations of 
problems. At the some time, techniques of considerable 
promise exist in other disciplines. Response surface 
methodology seems to bo one of these techniques of prom- 
ise. Hill and Hunter (1966), in a very complete survey 
of response surface literature, cite 1^.9 empirical ap- 
plications of the model. While those li-9 applications 
are primari3.y in chemistry, production problems with 
machine tools, truck tires, coastal bermuda grass, and pie 
crusts are included. Only one of the 49 references to 
applications of the model pertains to the study of human 



or animal behavior and that article (Moyer, 1963) is 
essontially a statement that the method should bo userul 
in the study of behavior rather than an application* 

Iho situation in the concept learning literature 
is comparable. Much experimental work has been done in 
this area using experimental tasks which may be highly 
inappropriate analogs of the concept learning which goes 
on in schools. Little or no empirical work exists which 
attempts to bridge the gap between the laboratory ex- 
periment and the learning activities of the classrooms. 

Given this context, this dissertation has as its 
purpose two major tasks: first, the application of a 

modified response surface type model to complex psycho- 
logical phenomena (this application will involve 
ordinal-scaled or integer-valued variables) and second, 
within the context of this response surface model, the 

investigation of a number of important independent vari- 
ables as they relate to a classroom concept leaxTiing 
task. The procedures will attempt to avoid the criti- 
cisms which have been directed toward most previous 
work in this area, specifically, the artificiality of 
tho experimental task, the finitenoss of the system, 
the lack of a vocabulary problem, and the inimediateness 
of the measure of learning. 
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Tha specific variables to be studied include: 

1) amount of redundant infoxraation 

2) mode of presentation of successive instances 

3) ratio of positive and negative instances 
k) order of positive and negative instances 

5 ) amount of information in accompanying verbal cues 

6) length of time the instances are available to 
sub j ect 

7) length of time between instances 

8) relative complexity of concept. 

Outline of the Remaining Chapters 

Chapter Two contains a brief survey of the response 
surface methodology literature, an exposition of the 
mathematical model which is the basis of response sur- 
face methodology, and a discussion of the difficulties 
in applying the model to experimental problems in edu- 
cation. 

Chapter Three is a description of the concept learn- 
ing experiment. Some findings from the literature are 
reviewed, the rationale for the study presented and the 
experiinental procedures described. 

The data collected from the set of experiments will 
be summarized in Chapter Pour. The results will be dis- 
cussed first from the experimental design viewpoint and, 
secondly, as they contribute to the concept learning area. 

Interpretation of the results will appear in Chapter 
Five along with a statement of conclusions reached and 

recommendations for future use of the findings of this 
* 

s tudy . 



CHAPTER II 



REVIEli/ OP RESPONSE SURFACE METHODOLOOY 

This chapter will briefly survey the Response Sur- 
face Methodology (RSM) literature, present in detail the 
mathoraatioal model, and discuss the various strategies 
to be used in applying the techniques to the exploration 
and estimation of a response surface. The problems en- 
countered in using RSM with discrete and ordinal scaled 
variables will be discussed and an. approximate empiri- 
cal solution to these difficulties as they apply to the 
exploration phase will be suggested. 

Survey of Respons e Sur fac e Meth odolo g y 

RSM is a comprehensive experimental strategy for 
optimizing responses, seen as functions of sets of in- 
dependent variables, and for estimating the functional 
relationship betvjeon response and variables that exists 
in the region of the optimal response. In its essence 
RSM is an integration of least squares--regression theory, 
geometric interpretation of algebraic equations, and em- 
pirical optimization strategies for use viith sequential 
exi)eririientation. While parts of these topics have been 

16 
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well understood for a long timo,*x* it was not until 1951 
that G.B.P. Box and K. B. Wilson (1951) Put them to- 
gether and coined the term, response surface. In the 
intervening years Box and others (Box and Hunter, 19575 
Box and Draper, 1959) have modified the strategy and 
techniques, primarily through the establishment of 
criteria for choosing experimental designs and the 
development of second- or higher order designs. In 
formulating practical approaches to the application of 
RSM within an ongoing industrial production system. Box 
(1957) iias coined another term, evolutionary operation, 
to indicate operational procedures leading to experi- 
mentally determined optimal responses within ongoing 
full-scale production runs. 

An early examination of the possibilities of ex- 
perimentally optimizing a response assumed to bo some 
unlmovrn function of independent variables and subject 
to error was by Hotelling (1941 )> who directed himself 
to the problem of finding the maximum response to an 
unknown function of a single variable. In so doing he 
suggested procedures ai^d raised questions which antici- 
pated the work of Box during the 1950 ‘s. Hotelling 



•X'Nevjman ( 1950 ) points out that Gauss and Legendre 
indepondontly formulated the theory of least squares dur- 
ing the first decade of the nineteenth century and ho 
credits Descartes and Format with "fathering” analytic 
geometry during the l600's. 
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suggested fitting a quadratic polynomial to observa- 
tions talcen as chosen values of the independent factor 
or variable and noted that two sources of error must be 
considered; that due to inaccuracy in taking observa- 
tions and that due to bias resulting from the fact that 
the unknown function might not be quadratic. Plackett 
and Burraan (1946) addressed themselves to the problem 
of the effect that alterations in components of a manu- 
facturing process vjould have upon some measured charac- 
teristic of the final assembly. While concentrating on 
static designs, primarily for the situation in which each 
component has only two possible values, their applica- 
tions of least squares theory and criteria for optimal 
multifactor designs definitely presage the later work of 
Box. 

The theoretical development of RSM has been largely 
the work of Box and various associates and is contained 
primarily in three basic papers. Box and Wilson (1951) 
present a thorough statement of the model, the under- 
lyirig philosophy, a criterion for choosing RSM designs, 
and examples of the use of RSM in the chemical industry. 
In the second paper, Box and Hunter (1957) introduce 
the concept of the variance function for an experimental 
design and advance reasons for preferring designs which 
are rotatable. First- and second-order designs meeting 
the criterion of notability are given. The third paper. 
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by Box and Draper (19?9 ) introduces the bias criterion 
for design selection. The content of these three papers 
will be discussed more fully in the section on the RSM 
model. 

In addition to these three papers several good, 
general statements of RSM have appeared. Box (1954) 
presents a thorough discussion of the principles under- 
lying the method with only limited explanation of cal- 
culation procedures. A full explanation of the model, 
methods, and calculation procedures, as well as a state- 
ment of the basic principles and strategy is in Chapter 
11 of The Des i gn and Analysis of Industrial Ex periments 
edited by Davies (195^) co- authored by Box. Box 

and Hunter (195^) present another thorough presentation 
in a chapter in Experimental Designs in Industr y edited 
by Chow. A somewhat less detailed, but very readable, 
review of RSM appears in a series of articles by Bradley 
(1958) and Hunter (1956, 1959a, 1959b) directed toward 
the industrial quality control worker. 

The remainder of the theoretical literature is con- 
centrated upon the construction of specific designs and 
the exaniinations of the statistical properties of those 
designs. V/ith the exception of an article by Spondley, 
Hext, and Himsworth (1962) suggesting the usefulness of 
simplex designs to evolutionary operation procedures. 
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these thoorotioal papers tend to have little applica- 
bility to the problems considered in this paper, largely 
because of the restrictive mathematical properties re- 
quired by the models, A very complete literature sur- 
vey of RSM written by Hill and Hunter (1966) lists these 
peripheral theoretical papers, as well as an extensive 
enumeration of the variety of practical applications 
which have been found for RSM. As indicated in Chapter 
One, these applications are to a wide variety of in- 
dustrial and agricultural uses, including the manufacture 
of truck tires (V/eissert and Cundiff, 1963), growth of 
coastal bermuda grass (Welch, Adams and Garmon, 1963), 
maximization of potato yields (Hermanson, 1965), pro- 
duction of machine tools (Wu, 196ij.), and improvement of 
pie crusts (Smith and Rose, 1963). The only reference 
to RSM to the behavioral sciences that this author has 
found is the dissertation work of Meyer (1961, 1963) 
which is theoretical rather than applied, 

Moyer (1961) investigated the use of RSM with 
integer-valued factors but the emphasis vjas upon the 
estimation of the shape of that surface in some static 
region rather than upon the exploration in search of an 
optimal region. By Monte Carlo procedures he examined 
a long series of designs with intogor-valuod factors 
and compared them to corresponding optimal designs 
with continuous -valued factors. 



In summary, the thoorotical model for use with 
continuous variables is well developed and applications 
to a wide variety of industrial situations have been 
reported. However, applications of the RSM model to 
empirical problems in education and psychology seem to 
be nonexistent. V/hile this condition is due in large 
part to the restrictions which the mathematical model 
places upon the nature of the independent variables 
and the metric in which they are measured, it is probably 
also true that lack of familiarity with the technique 
on the part of behavioral researchers has contributed to 
this void of behavioral applications. 

The Response Surface Model 

The general problem for which RSM was developed is 
that of empirically investigating a workable production 
system for the purpose of identifying, if possible, com- 
binations of the input variables to the system vjhich 
would optimize some measure of the yield or output of 
the system. In the discussion that follovjs the inputs 
will be referred to as variables and the outputs as 
responses. The responses may be measures of qu£intity 
or quality of output, in which case the interest is in 
identifying combinations producing a maximum response, 
or it may be a measure, such as cost per unit produced, 
in which case the optimal response vjould bo a minimal one. 
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That the system is a workable one implies that a com- 
bination of relevant variables has been identified 
which results in a minimally acceptable response. 

Whether the response is one, such as amount of chemi- 
cal produced seen as a function of time, temperature 
and per cent solution of catylyst, or grain yield per 
acre as a function of amount of fertilizer and irrigation, 
or amount of learning as a function of number of exam- 
ples and time spent on each, it is highly unlikely that 
the original combination which results in an acceptable 
product will be close to the optimal combination. Thus, 
the first aspect of RSM is the development of experi- 
mental procedures for efficiently moving from some ori- 
ginal v/orkable combination of variables to a combination 
which will, in some sense, result in an optimal response. 

Once one has identified what appears to be an op- 
timal response on one criterion, it is reasonable to 
investigate the possibility of multiple optimal combina- 
tions and, should they exist, attempt to identify among 
them that combination which is optimal on some second 
criterion. For example, if a number of combinations of 
levels of variables result in equal maximal quality of 
product, one might inquire as to which of these mini- 
mizes the cost per unit produced. This leads to the 
second aspect of RSM, that of identifying experimental 
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designs which will enable one to efficiently estimate 
the nature of the relationship that exists between the 
variables and the response when the levels of the 
variables are in the optimal range. Thus, RSM has 
two aspects, the first exploratory and the second aimed 
at the estimation of the functional relationship. 

Box and Wilson-ic- (1951) originally stated the prob- 
lem as follows: One is interested in a response,)^, 

which is supposed dependent upon the levels of a set of 
k quantitative factors or variables, Xi> • • • >Xj^, • • * ,Xic. 
These variables are subject to exact measurement and 
control. Thus, for the u^^ combination of the levels 
of the variables (u=l, 2, '* * * ,n ) , 

Hu = f (Xiu>X2u» * * * >^ku) • 

This unknown function, f, will bo referred to as the 
response function. The observed response to any combina 
tion of levels of the variables, y^, is subject to ex- 
perimental error and, in repeated observations, varies 

p 

with mean and variance cr^. 

Viewed geometrically, the response and the k vari- 
ables define a (k+1 ) -dimensional Euclidean space. The 
response to any given combination of levels of the 



vcThis description, of the model, experimental strategy, 
and criteria for choosing RSM designs is based upon the 
work of G’.E.P. Box and his associates, particularly the 
basic articles mentioned on page 17 > Box and Wi3.son (1953. )> 
Box and Huxitor (1957 )> anh Box and Draper (1959 )• 



variables is a point in that space and the totality of 
points corresponding to tho responses to all possible 
combinations of tho variables form a surface over tho 
k-dimonsional hyporspaco defined by the variables, ^his 
surface is tho response surface (RS). 

Circumstances which will influence the strategy of 
the investigator in exploring this RS are the magnitude 
of tho experimental error, tho complexity of tho response 
function (or surface), and whether or not tho experiments 
may bo run sequentially with each experiment designed 
using knowledge from preceding ones. 

One approach to identifying optimal conditions 
would be to examine the entire domain of the k variables. 
This can. always bo accompli shod by saturating tho domain 
with a sufficient grid of points to allov; adequate ap- 
proximation of the response function, f. This, in gen- 
eral, v;ould require an impracticably large number of 
points. However, in those situations whore tho experi- 
mental error is small and the experiments can be run 
sequentially, there are strategies which will locate 
optimal combinations with greatly reduced experimental 
effort. 

Small experimental error implies that small changes 
in>|can be accurately detected and small sub-regions of 
the RS can be investigated with a small number of 
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exporimontal points. Whan exporimonts can bo run se- 
quontially, knowledge from one sot of experiments con 
bo used to identify a new sot of conditions resulting 
in a higher (or lower) level of response. By a series 
of such moves ono should movo from any arbitrary be- 
ginning set of exporimental conditions to a set which 
results in uniformly high (or low) responses, i.e., to 
a region of optimal response. V/hile this region may 
only provide a local rather than an ultimate optimum, 
this is an inevitable risk of limited exploration of 
the variable space. 

Tftfhilo the entire surface (or its algebraic equiva- 
lent, the function f) may be quite complex, knowledge of 
many phenomena suggest that a reasonable assumption is 
that it is both smooth and continuous. Thus, the func- 
tion may be approximated-::- to any dosirod degree of ac- 
curacy by a polynomial of sufficiently high degree. If 
either k, the number of variables, or d, the required degree 
of the polynomial, is largo a polynomial of the form 
p = Sq + aiXi +a2X2 + ’ ‘ * 

^ * * * "*'^kk^k^ ’*'^12^1^2 * * * 

+^lll^l^+a222^2^'^ • • • +ajji^kXk^+ai3^2Xl^^2+ * * * 

-x-The Wei ers trass approximation theorem states: If 

f is continuous on the closed interval ra,b] , thon it can 
bo unifoi'mly approximated to within « on fa,bl by a poly- 
nomial, for anye>o. (Buck, 1956; p. 39) 
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can require the estimation of a considerable number of 
constants.*'^ The number of constants to be estimated for 
k=l,2,*’*,5 and d=l,2,***,5 are given in Table 1. Since 
the minimum number of points needed to fit any polynomial 
is the number of constants, the figures in Table 1 pre- 
sent the minimal numbers of experiments needed to fit 
the approximating polynomial for given d and k. Add to 



TABLE 1 

Numbers of constants to be estimated in 
fitting a polynomial of degree, d, in k variables. 



Number of 
Variables (k) 


1 


Degree of polynomial (d) 

2 3 k 5 


1 


2 


3 


Al- 


5 


6 


2 


3 


6 


io 


15 


21 


3 


k 


10 


20 


35 


56 


k 


5 


15 


35 


70 


126 


5 


6 


21 


56 


126 


252 



this minimal number the experiments needed to adequately 



sample the entire domain of the variables and the ropli- 

2 

cations necessary to estimateCT and the experimental ef- 
fort needed to adequately approximate f over the entire 
domain of the Xj is, in general, impossible in practice. 

Fortunately, in most applied situations practical 
restrictions on the variables limit the domain which is 



■''''The notvation used in writing polynomials subscripts 
each constant with a sot of indices consisting of all the 
indices of the corresponding variable or product of vari- 
ables. Thus, the coefficient of X2^^3=X2 * X2 * ^223* 
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of initial interest and the region of ultimate interest, 
that over which one would like to estimate f, is even 
further restricted to ai'eas of optimal or, at least, 
near-stationary response. The result of these restric- 
tions on the domain of interest is that polynomials of 
degree 1 or 2 usually provide adequate, local approxi- 
mation to f bringing the needed expenditure of experi- 
montal effort within reasonable practical levels. 

One begins the investigation of a RS in some arbi- 
trary sub -region of the variable space. It is unlikely 
that responses in that region are either minimal or maxi 
mal. If the region is not one in which the response is 
near optimal, then one is in a region where the RS has 
a slope relative to the hyperplane defined by the vari- 
ables. The farther one is from either a maximum or 
minimum, the greater that slope is likely to bo. 

The first phase of the RS problem is that of ef- 
ficiently moving from this arbitrary initial region 
of investigation to an optimal one. Box suggests that 
the move should be that which results in the greatest 
increment in the response, i.e., perpendicular to the 
contour lines of the RS or in the direction of steepest 
ascent. Since the only information desired from the 
initial set of experiments is that of which direction 
to move next, and, since the slope of the RS is likely 
to be relo.tively steep, the local apxJroxiraation of the 
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RS by a plan© should ba sufficient. As one, through 
a sequence of moves, gets closer to the optimal region 
the slope will decrease and a quadratic or cubic sur- 
face may bo needed. 

The local approximation of the RS by a plane is 

the equivalent of estimating f by the first degree 

polynomial, Pi=ao+aiXi + * * * +aj_Xi+* * * +ai^Xk* 

The direction of steepest ascent can be found by cal- 

3Pl 

culating the first order partial derivatives, of 

pi- The direction of steepest ascent will be that 
taken when increments of the Xj|^ are proportional to 
these partial derivatives. A check on the adequacy of 
the calculations is provided by the extent to which the 
indicated increments of X^ do lead to increments in the 
response. 

Movement is made to a sub-region up the slope of 
the RS from the initial region, a new sot of experi- 
ments run, and the slopes redetermined. By a series 
of such steps one moves to regions of higher and higher 
response. However, the higher up the RS on© moves, the 
more gradual the slope of a plane becomes and the more 
likely on© is to move as a result of experimental error 
rather than a true increment in response. Also, the 
plane becomes a less and less satisfactory approximation 
to the RS. On© is in a near-stationary region. The 
second phase of RSM is the exploration of this region. 
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Typical Reaponso Surfaces in regions 
of near-stationary response. 
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This second phase of the investigation attempts to 
discover whether the near-stationary region is a) a 
true maximum (as in Figure l,a), b) a ridge system, 
either stationary or slowly rising (as in Figure l,b 
and c) or c) a saddle point or col or minimax (as in 
Figure l,d). If the region is not too large, a second- 
degree polynomial should provide an adequate approxima- 
tion. 

The designs suggested by Box for this first phase 
exploration are the two level factorial designs and, 
in situations where it may bo assumed that factors will 
not interact, the two level fractional factorials. The 
center of each design is chosen as the origin and the 
units of each variable so chosen that the levels of each 
are +1 and -1. 

The choice of appropriate units for the several 
variables needs comment as the slopes of the fitted plane 
and resulting path of steepest ascent are clearly not 
invariant under a change of measurement scale. The 
procedure which makes most sense for choosing appropriate 
units for the variables is that of choosing units so that 
unit increments in any single variable lead to approxi- 
mately similar increments in the response. Success in 
these choices are obviously a function of the skill and 
experience of the experimenter. Adept choice of scale 




results in an approximately symmetrical representation 
of the response function which reduces the amount of 
calculations to be done in the estimation phase of the 
problem. Because of the sequential nature of the pro- 
cedures, an inappropriate choice of scale unit may be 
detected and corrected in later experiments. 

When any one variable produces effects that are 
quite small compared to those of ofther variables, it 
may be that: 

a) the unit chosen is inappropriately small, 

b) the response is independent of that variable, or 

c) one is at a conditional maximum for that variable. 
By increasing the size of the unit and changing the level 
of that variable from the calculated path, one may de- 
termine the exact situation. 

The approximation of the RS in the near-stationary 
region requires a polynomial of, at least, degree two. 

An experimental design must be chosen v;hich will both 
provide efficient estimates of the constants in the poly- 
nomial and facilitate the recognition of the character- 
istics of the RS in that region. Given an appropriate 
design, a second-degree polynomial can be fitted to the 
RS. A check on the adequacy of the fit can be made by 
partitioning the sum of squares into that due to the 
approximating polynomial (due to regression) and that 
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which represents lack of fit (due to deviations from 
regression). An analysis of variance may bo done and 
the mean square for lack of fit examined. If the second- 
degree polynomial provides an adequate approximation to 
the RS, this mean square for lack of fit is an estimato 
of the variance due to experimental error. If the 

fit is not adequate then the mean square should be in- 
flated by amounts due to the extra constants and should 
bo considerably larger thancr^. Thus, if an independent 
estimate of is available, a comparison of this esti- 
mate with the mean square provides a test of the ade- 
quacy of the second degree polynomial in approximating 

the RS. 

Given that an adequate approximating polynomial 
has been calculated, one still has the problem of recog- 
nizing the true nature of the surface from the study of 
the equation. As the polynomial will be oriented about an 
arbitrary origin, it will, in general, bo difficult to 
learn much about the surface from direct examination to 
the constants. Box suggests that the polynomial be trans- 
formed to canonical form, i.e., to a form oriented about 
an origin at the estimated point of optimal response with 
axes along or perpendicular to the major axis of the RS. 

The canonical form for a second degree polynomial 

is of the form 

Y " "^m == ^11^1^ A22^2^ ^klc^k^ 
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where Y is the response, Y^ the response at the esti- 
mated point of maximal response, and the the set of 
axes corresponding to the axes of the RS. 

The estimated point of optimal response is found 
by taking the partial derivitivos of the approximating 
polynomial relative to the k variables, setting 

each equal to zero and solving for the coordinates of 
the point. The may bo found by solving the character- 
istic equation 



ail '^^12 Wk 




All 0 ••••0 


W 2 *^22 ••• Wk 




0 A 22 ‘ ‘ 0 


• • • • • • 




• • • • • • 


• • • • • # 

^^Ik ^2k ' ■ ' “kk 




• • • • • • 

0 0 ••• Aj,k 



for the A^j^. The transformation from the original X^ 
system to the x^ one oriented to the RS is accomplished 
by solving the matrix equation 

kxk kxl kxl 



[’] h -^ 1 .] ■ [“] [■'1 

subject to the restriction that j_ T J I Tj = [ I | where 
l| is the k k identity matrix. 

In the case where there v;ere on3.y tvjo variables the 
second degree canonical form vrould be 



^ ^m ~ -^11^1^ ^ 22 ^ 2 ^* 

Examining the possible shapes of the surfaces pictured 
in Figure 1, the advantages of the canonical form can 
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be easily seen. When the contour lines of the surface 
are ellipses (as in Figure 1, a) then both All and A22 
will be negative with the relative absolute magnitude 
of the constants indicating the degree of elongation. 

The presence of a ridge system (as in Figure 1, b) would 
be indicated by one of constants being approximately 
zero. Should the ridge be of the slowly rising type, 

V 

indicating a maximum at infinity (as in Figure l,c), 
the origin could be taken at some convenient point on the 
axis of the ridge, in which case the canonical form 
would be 

Y - Yjyj = 4- 

with being the slope along the x^ axis. When the 
signs of the constants, A^^^ and A22» opposite one 

has a RS in the shape of a saddle point or minimax (as 
in Figure l,d). 

To this point the need to approximate the RS by 
a polynomial has been indicated several times, but 
nothing has been said of the procedure for calculating 
the best polynomial. The actual procedure used is that 
of fitting the polynomial to the RS by the procedures 
of linear least squares or curvi -linear regression. 

That is, the constants, aj^, in the polynomial are chosen 
so as to minimize the sum of squares of the deviations 
between the responses predicted by the equation and 
those actually , observed as a result of experiments. 



Tho basic assuraption ol‘‘ least squares is that the 
observed response is a linear function of a sot of 
latent parcmeters or constants. It should be noted 
that this permits the estimation of polynomials since 
the linear restriction is on the constants, not the 
variables. Further, while the variables are assumed 
error-free this does not seem to be a particularly 

j 

serious source of difficulty when the variables are 
levels or poi^ers of levels of factors which are set 
prior to the running of experiments as they are here. 

In the previous pages the overall strategy and 
general procedures for a response surface exploration 
and estimation problem have been outlined. The next 
problem to be considered, and that which has monopolized 
the efforts of theoreticians in this area is the experi- 
mental design problem of carrying out these procedures 
in the most efficient manner. That is, what strategies 
can be devised for selecting experimental combinations 
of levels of the variables that will provide the great- 
est amount of information, i.e., the best fitting 
polynomial, for a given amount of experimental effort. 
The three major contributions to this problem have been 
the original Box and Wilson (1951) paper which intro- 
duced central composite designs. Box and Hunter’s (1957) 
suggestion that desirable designs have the property of 
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rotatability and the Box and Draper (1959) paper ©m- 
pliasizinG importance of the criterion of minimal 

bias * 

Box and Wilson (1951) point out that, while the 
two level factorial and fractional factorial designs are 
adequate for estimating plane surfaces, the efficient 
estimation of qu Iratic and cubic surfaces requires a 
different configuration of points. The composite de- 
signs suggested are basically factorial designs which 
may bo augmented by additional points at the center and 
on the axes of the design should a first degree poly- 
nomial be deemed an inadequate fit. 

Box and Hunter (1957) introduced the concept of 
the "variance function," V(x), of an experimental de- 
sign, which they define as 
V(x) = N V(7x)/<^ 

where V(y^) is the variance of the estimated response 
at any given point, Tiie variance function proviues a 
standardized measure of the precision with which bhe 
design estimates the surface. If the value of V(x) 
at any given point is strictly a function of the distance 
of the point from the center of the design thon the de- 
sign is said to bo rotatable, i.e,, rotation of the axes, 
as in the transformation to canonical' form, will have 
no effect on the precision with vjhioh the design esti- 
mates the surface. In the situation where one does not 



know tho orientation of tho RS to the hyperplane of the 
variablo3 prior to the running of the experiment there 
are obvious advantages to a design meeting this criterion. 

However, Box and Draper (1959) point out that in 
approximating any unknown function by fitting a poly- 
noraial to empirical data one not only runs the risk of 
error duo to sampling, but, also, due to possible in- 
adequacies in the polynomial as a representation of the 
true function. This latter source of error is referred 
to as bias error, in contrast to tho former, variance 
error. In investigating the problem of designs to mini- 
mize both sources of error, they studied the situation 
where a first degree polynomial was used to approximate 
a function that was truly a second degree polynomial. 

They arrived at the conclusion that, when both variance 
and bias error are present, the optimal designs for mini- 
mizing both sources of error are very similar to those 
which ignore variance error completely and only minimize 
bias error. This is particularly important since the two 
criterion, minimize variance error and minimize bias 
error, lead to different design principles. Minimizing 
variance error leads, in general, to large designs with 
points outside the region of immediate interest while 
the bias criterion calls for designs with experimental 
points within tho region of immediate interest. 
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The additional theoretical papers vhioh have ap- 

♦ 

pearod since the Box and Draper paper have attempted to 
apply these ideas to more complex situations where the 
approximating polynomial and the true function are of 
higher degree, but no really new approaches or strategies 
have been proposed. 

An interesting extension of RSM has been to the 
problem of maintaining an ongoing production process at 
near-optimal conditions in spite of the effects of changes 
in the quality of input materials, systematic changes in 
instrumentation, and differences between the conditions 
of a full scale production run and those of a limited, 
more tightly controlled laboratory routine. Since all 
production processes have tolerance limits within which 
the quality of product is acceptable, it is possible to 
introduce small variations in the production conditions 
leading to information about the continued optimal char- 
acteristics of those production conditions while still 
maintaining acceptable product output. Box ( 1957 ) intro- 
duced this idea under the name, evolutionary operation 
(EVOP). EVOP is, in essence, the continual application 
of the optimization or exploration phase of RSM to an 
ongoing production process. The basic idea underlying 
EVOP is that the operation of any industrial process 
for the sole purpose of producing a product is inefficient; 
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the pi'ooess should be made to produce both product and 
information as to how the product might be improved. 

Since one is only concerned with the levels of variables 
which produce optimal responses the local approxima- 
tion of the RS by a plane is adequate and the design for 
the variations in the ’’works process” can be quite sim- 
ple. Box (1957)i in explaining BVOP, uses a 2 -lev el 
factorial. 

Another design which has many desirable properties 
for investigating a RS when a plane is deemed a suf- 
ficient approximation is a simplex design. While Box 
and Hunter (1957) only allude to the desirable character- 
istics of simplex designs, a full description of the 
application of the design to optimization and EVOP prob- 
lems is presented by Spendley, Hext, and Himsworth (1962). 
Since it is the use of a simplex design which is fol- 
lowed in the sequence of concept learning experiments 
to be described in Chapter Three, the usefulness of the 
design, as presented by Spendley et al., will be des- 
cribed at length. 

A simplex is that minimal geometric shape which 
can exist in a Euclidean space of given dimension but 
not in a space of lesser dimension. Thus, a line seg- 
ment is a simplex in one dimension, a triangle a sim- 
plex in two dimensions and a tetrahedron a simplex in 



throo dimensions with the concept generalizable to k- 
dimensional space. A regular simplex is one in which 
the distances between vertices are equidistant. In an 
experimental situation with k variables, a regular 
simplex design would consist of (k+1) experimental com- 
binations representing the vertices of a regular simplex. 

The basic simplex design with k variables is given by 
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where P = | (k-1 )+V^ ^ and q = ^ |Vk+i 




Each of the (k+1) rows of this matrix are the coordin- 
ates of a vertex of the simplex. Regularity is not seen 
as a problem when the variables are measureable on con- 
tinuous, interval scales, because regularity can easily 
be obtained by an appropriate linear transformation of 
any one or more of the variables. 

At this point certain characteristics of a sim- 
plex should be noted. A simplex of any given dimension, 
say k, has (k+l) vertices and (k+1) faces. Each face of 
a k-dimensional simplex is a simplex of (k-1) dimensions. 
The removal of any one point or vertex from the design, 
thus, collapses the design into a simplex of one less 
dimension. Correspondingly, the addition of a new point 






to the design (a point not on the hyperpiano defined by 
the existing simplex) creates a new simplex of one 
larger dimension. 

Since the exploration of a RS is a series of moves 
of an experimental design from one region of the vari- 
able space to another, the characteristic of a simplex 
design which permits one to create a new simplex by 
removal of only one vertex and the subsequent substi- 
tution of any new point not on the hyperplane defined 
by the remaining points is a particularly useful one. 

Any exploration procedure consisting of a set of 
moves requires a set of rules for making those moves. 
Rules are needed for decisions specifically concerned 
with: when to move, in what direction and how far to 

move, and how to calculate the new combination or com- 
binations. The rules given by Spendley £t al^. are 
particularly simple. 

The decision rule concerning when to move, i.e., 
when to change the levels of the independent variables, 
is that one moves or changes combinations as often as 
possible. Thus, one runs experiments at the first (k+1) 
combinations and from then on each additional experi- 
ment is a new combination, the exact nature of which 
is determined by the results from the immediately pre- 
vious (k+1) experiments. 



Two logical alternatives in a rule for deciding in 
which direction to move would be either to move away from 
the point of least optimal response or to move toward 
the point of most optimal response. Note that both of 
these procedures divide the original set of (k+1) points 
into two subsets of more or less optimal responses, one 
of 1 point and one of k points. The first alternative 
defines the least optimal subset as having a single point 
while the second defines this subset as having k points. 
Since the purpose is to find a region of optimal re- 
sponse, the changes made should be those of replacing 
the least optimal set with a new set of points which lie 
on the opposite side of the most optimal set from those 
being rejected. Thus the implication of the first; alter- 
native is the replacement of one point by another at each 
step of the sequence while the second implies that a set 
of k least optimal points be replaced by a new set of 
k points at each step. In either case the new config- 
uration of points should approximate the mirror image 
of the original configuration. 

The second alternative clearly results in the 
greatest change in position of the simplex design but 
also requires the greatest amount of experimental effort, 
in the two variables case, twice os much. Furthermore, 
since .the responses at all experimental points are 



equally subject to error, the second rule clearly in- 
volves the most risk as it involves using the informa- 
tion from (k+1) responses to make k changes while the 
first approach is using the information from the (k+1) 
responses to make a single change. On the basis that 
the risk of incorrectly responding to responses subject 
to error should be minimized, the first alternative is 
followed, i.e., reject the single least optimal point 
and choose a new point opposite the face of the sim- 
plex defined by the remaining k points. By keeping 
the simplexes of equal size, one also controls the 
distance moved. 

The third decision is that of determining the levels 
of the variables for the now combination. That point 
which will result in a new simplex on the opposite side 
of the face defined by the k more optimal points is 
found by taking, as any coordinate of the new point, 
twice the average of the corresponding coordinates of 
the remaining points minus the corresponding coordinate 
of the discarded point. 

Several advantages of the use of simplex designs 
with these decision rules should be noted. 

1) The arithmetic involved, both in the calcu- 
lation of nev/ design points and in the evalu- 
ation of the response measures, is relatively 
simple compared to the steepest ascent methods 
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suggested by Box and Wilson (19^1)* 

2) The necessary decision rules can be simply 
stated and easily followed. Neither hypo- 
thesis testing nor interval estimation pro- 
cedures are used, the idea being that a de- 
cision inappropriately made duo to errors 
in the responses will be corrected in sub- 
sequent steps of the procedure. 

3) The direction of advance of the design from 
the original set of points to a set in the 
region of optimal response is dependent solely 
on the rank order of the responses and not 

on their value on any absolute scale. Thus 
the response data may be judgments or rankings 
rather than measurements in the interval or 
ratio scale sense of the term. 

l\.) One need depend little on any assumption of 
planarity of the response surface except in 
the immediate region of any single simplex, 
an assumption which is quite tenable given 
appropriate choice of the levels of the 
various factors. 

5) The dimensionality of the variable space can 
readily be expanded or contracted. A new 
factor or variable, previously held constant, 
can be added to the design by the addition, of 
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SL singlo now point. Tiio contraction of 
a design, by removal of a variable which 
appears to be having little effect on the 
response, is somewhat more complicated be- 
cause it could result in loss of regularity. 

As in any system where experimental error is pres- 
ent, one will occasionally make a false move, i.e., a 
move in an inappropriate direction, because the treat- 
ment effects are obscured by the error. This is not 
particularly serious, however, as the procedure calls 
for continuous review of past decisions. Errors at 
one step of the sequence can be expected to be caught at 
a later stage and the greater the adverse effect of the 
inappropriate move, the more rapidly it will be detected. 

The size of the error standard deviation can be 
reduced by replication of the experiments at each com- 
bination but Spendley et a^. demonstrates that it is 
more efficient not to expend effort in replication but 
to instead run a longer sequence of experiments. He 
also presents empirical data that suggests that the 
procedure works better with larger numbers of variables 
than with smaller. 

Applying RSM ^ Educational Research 
The major premise upon which scientific, meaning 
empirical and inductive, investigation is based is that 
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of tho essential lawfulness of nature. The outputs 
occurring in any natural system or subsystem of interest 
must be viewed as functionally dependent upon both the 
initial state of tho system and the kinds and amounts 
of relevant inputs to the system. Thus, in the behavioral 
sciences, as well as in the natural sciences, the ulti- 
mate outcome of research activity should be the specifi- 
cation of tho relationships which exist between status 
suid input variables on the one hand and outputs on the 
other. However, when the variables are categorical rather 
them quantitative, ordinal rather than interval, or dis- 
crete rather than continuous, as they often are in the 
behavioral sciences, difficulties vdth the mathematics 
needed to construct an appropriate model have often 
forced researchers to adopt experimental or research 
strategies which obscure this ultimate goal of the spec- 
ification of functional relationships. Even in areas 
whore tho existence of relationships among sets of vari- 
ables are well established and the nature of those re- 
lationships indicated, researchers in education still 
revert to procedures which can only answer the prelim- 
inary question concerning what variables are related 
or the comparative question inquiring as to which of 
two, possibly mediocre, choices is the better. In 
contrast, RSM begins witn the assumption that the gross 
nature of the relationships among tho variables in a 
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system is known and aims at answers to the questions of 
which is the best combination of variables and what is 
the precise nature of the relationships that exist. 

It is the contention of this writer that there are 

r 

areas of education where the gross nature of relation- 
ships among variables are v/ell enough understood to 
warrant an attempt to fit this knowledge into the RSM 
model that has! been described and attempt both the op- 
timization of responses and the approximation of rela- 
tionships existing between variables and response in 
the region of optimal response. While the second task 
is, perhaps, the more difficult, the first must take 
precedence and the empirical aspects of this paper will 
attempt to demonstrate that this can be done. 

The difficulties in applying RSM to the problems 
of education are several. Probably the most apparent 
difficulty is with the metric characteristics of the 
variables identified as relevant. All too often these 
are categorical rather than quantitative. Since cate- 
gorical implies that there is no rationale for ordering 
the levels of the variable, there can be no RS in the 
sense discussed here. However, researchers in education 
have too willingly accepted identified categorical vari- 
ables as important for study and have, perhaps, spent too 
littlq effort in attempting to analyze categorical vari- 
ables into possible underlying quantitative variables. 
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Another metric problem is that of measuring those 
variables which are quantitative on scales which have 
the relevant metric propertios--those of being continu- 
ous and equal interval. Neither is an insurmountable 
obstacle if one is willing to tolerate a certain amount 
of imprecision and inefficiency in the design. The use 
of a scale which is ordinal but not sufficiently equal 
interval will cause the response function to be un- 
necessarily complex but should not seriously inter- 
fere with the optimization process. When the variable 
is discrete, the RS breaks down and becomes a lattice or 
net of points rather than a surface. While it may make 
good experimental sense to operate as though this lat- 
tice or net of points would all lie on some smooth sur- 
face, one should realize' that the mathematics upon which 
RSM is built requires that the totality of responses 
form a surface and that the violation of this requirement 
could lead to peculiar results. 

The changing nature of the experimental material is 
yet another difficulty. Most of the responses of inter- 
est to educators are some concomitant of physical or 
cognitive growth and any procedure v/hich runs experiments 
consecutively rather than simultaneously runs the risk 
of confounding treatment effects with those of normal 
growth or development. 
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BdCAUso RSM dicba.tos that axporiiuants bo run so- 
quentially, one must be especially careful that the run- 
ning of an earlier experiment doesn’t contaminate the 
experimental material to be used later. Combined with 
the possibility that a long sequence of experiments may 

be run, one needs access to a relatively large supply of 
subjects . 

It should be noted that the difficulties with the 
mathematics due to the grossness of measurement techniques 
is far more serious for the problem of estimation of re- 
lationship in the area of optimal response than it is for 
the exploration phase leading to that optimal response. 

Summar y 

In surveying the RSM literature, the pre-eminent 
role of G.E.P. Box in its theoretical development was 
stressed. An indication of the breadth of practical 
applications of the methodology in industry was contrasted 
to the void of application in behavioral research. 

The mathematical model has been presented in some 
detail, both the exploration and estimation phases. An 
approach to the exploration phase due to Spendley et al. 
and utilizing certain very desirable properties of sim- 
plex designs has been outlined. 

Difficulties In applying RSM to behavioral, spec- 
ifically educational, research have been discussed and 
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the point made that the difficulties are far more ser- 



ious with the estimation phase of the procedure than 
with the exploration. 







CHAPTER III 



AN APPLICATION TO CONCEPT LEARNING 

As indicated in Chapter I a major purpose of this 
dissertation was to demonstrate that the experimental 
procedures and strategies presented in Chapter II could 
bo usefully applied to areas of behavioral science of 
specific interest to education. This chapter is a 
description of such an appliration. A brief introduc- 
tion to the topic of concept learning and tho rationale 
for using it as tho vehicle for testing tho optimiza- 
tion ideas of RSM will be presented first. This v/ill 
be followed by a review of tho procedures follovred in 
running tho sequence of experiments including those 
dealing with development of the experimental task, the 
response measure, the population from which subjects wox*e 
drawn and tho steps follovjed in the actual performance 
of tho experiment. 

As indicated at the conclusion of Chapter II, Ijhe 
basic requirement for the conceptualization of a re- 
search problem in the RSM pattern is that the variables 
and response of interest be quantitative. A second 
requirement is that a supply of relatively stable ex- 
perimental material bo available. Third, one must be 

\ 

woidiing within an experimental procedure which v;ill permit 
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rather strict control of irrelevent variables aiid will 
yield a relatively small standard error. Finally, a 
research area in which a number of relevant variables 
had been identified would permit an examination of the 
methodology in a truly complex setting. The topic of 
concept learning seemed to fit this bill of particulars. 

Revi evj of Concept Learning 

Interest in the study of the psychological process 
of conceptualization has been of concern to psycholo- 
gists since the beginning of the century under such 
names as generalizing abstraction (Fisher, 1916), con- 
cept formation (Hull, 1920; Smoke, 1932), concept at- 
tainment (Heidbreder, 19^4-6), concept learning (Hovland, 
1952 ), and concept identification (Archer ^ > 1955) 

Examination of the experimental tasks required of sub- 
jects by all of these investigations reveals sufficient 
commonality to conclude that all were interested in 
aspects of a single psychological process which will be 
referred to as concept learning. 

The chronological trail from the early v;ork of 
Moore (1910) and Fisher (1916), who relied exclusively 
on introspective reports of subjects for data, to the 
first attempts at quantifying concept learning by PIull 
(1920), to the relatively recent work of Bruner, Goodnow, 

I 

and Austin (19 5^) reveals two interesting trends. The 



most obvious one is that of moving towards greater ob- 
jectivity by repeated modification of the experimental 
task in the direction of removing equivocal sources of 
variability. The second trend, which is actually a re- 
sult of the first, is that of the experimental task be- 
coming a less and less adequate analog of what a class- 
room teacher would consider the learning of a concept. 

The subject is being asked to solve a finite problem 
rather than add a new concept to his cognitive reper- 
toire. 

The early experiments of Fisher, Hull, and Smoke 
required the learning of arbitrary concepts which were 
characterized by having a novel, nonsense name and by 
the possibility of their being represented by an in- 
finite number of percepually differentiable exemplars. 

By contrast, examination of the recent work of Bruner 
et al. , which has set the pattern for much of the ex- 
perimental work of the past dozen years, leads to the 
characterization of the concepts learned as having a name 
familiar to the subjects, being applicable to a finite 
set of stimuli and having little or no perraanence. In 
the introductory remarks in Chapter One a case v;as made 
that tasks presently being used in concept learning stud- 
ies were inadequate analogs of tasks presented children 
by teachers under the rubric of learning concepts. The 
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points of inadequacy were the finiteness of the universe 
of exemplars of the concept, the use of familiar vocabu- 
lary in familiar ways to name the concept, the absence 
of any expectation that the learning will persist and 
the often systematic way in which stimuli are presented. 

The chronicle of this transition from tasks which 
were quite adequate analogs of real life concept learn- 
ing to present tasks which are more closely analogous to 
the solving of a problem is long, devious and obscure. 

The obscurity of the trend is, in no small measure, at- 
tributable to a tendency on the part of researchers to 
avoid defining ambiguous terms. Perhaps a single example 
from the very influential A Study of Thinking (Bruner et al. , 
1956 ) will demonstrate what seems to have happened in the 
literature on this topic. 

Bruner et_ aL develop their idea of how concept 
attainment takes place by describing the manner in which 
a hypothetical foreigner might, over a period of time, 
by using information about individuals that he meets, 
develop and attain a concept of what an "influential” 
person is. The process described is one of meeting in- 
dividuals, noting characteristics of the individual and 
receiving information as to how influential the person 
is. Bruner ^ al . maintain that by a variety of strate- 
gies this information is processed and results in a concept. 



in this case that of an influential person. This dis- 
cussion of the theory underlying the attainment of a 
concept is concluded by an unsubstantiated statement, 
**One can specify the minimum array of instances nec- 
essary in order for our hypothetical foreigner to solve 
the problem of who is influential.” (Bruner et al., 
1956» P» 65 )• No evidence is presented that such an 
array can be specified and there is no recognition by 
the authors that a very basic assumption has been made. 
This statement implies that the list of relevant at- 
tributes of the concept, influential, is both finite and 
determinate. If one can accept this assumption, then 
the use of finite arrays of geometric figures as the 
stimulus material for concept learning tasks can be 
accepted. However, this writer cannot accept this 
assumption and, therefore, cannot accept ’’blue circles” 
as an equivalent type of cognitive thing to ’’influential 
person.” Yet this obscure statement does serve as a 
connecting link between the theoretical and the empiri- 
cal portions of the Bruner book. 

The experimental papers which have appeared in the 
educational and psychological journals during the past 
fifteen years have, by and large, appeared to make the 
same assumption. Article after article (e.g.. Hunt and 
Hovland, I960; Glanzer, Huttonlocher, and Clark, 1963; 
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Katos and Yudin, 1964) purports to be studying some 
aspect of concept learning with the only justification 
for the product of the experimental task being called 
a concept being that others have used similar tasks to 
study concepts. The point of this discussion is not 
that the empirical evidence which has been accumulating 
has no relevance to the problem of how one learns a 
concept, but, rather, that we may have considerable dif- 
ficulty transferring these findings to the classroom 
because certain important aspects of the concept learn- 
ing task have been omitted. 

The experimental procedures and tasks to be des- 
cribed later in this chapter have been adopted, as far 
as possible, to parallel the type of task used in recent 
empirical studies of the problem while, at the some 
time, meeting the objections to those tasks which have 
been raised here and in Chapter One. 

Klausmeier, Davis, Ramsay, Fredricks and Davies 
(1965) have compiled a rather complete bibliography 
of the concept learning and problem solving literature. 
Included is a taxonomy of vaiables seen as relevant to 
the concept learning process. Of this list of over one 
hundred variables, the following eight were selected for 
incD.usion in the sequence of experiments: 

1) amount of redundant information. 
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2) mode of presentation of successive instances, 

3) ratio of positive and negative instances, 

ii-) order of positive and negative instances, 

5) amount of information in accompanying verbal cues, 

6) length of time instance available to subject, 

7) length of time between instances, and 

8) relative complexity of concept. 

The Experimental Design 

The usual procedure in the design of an experi- 
ment is for the researcher to identify the problem area 
of interest, to formulate specific questions to be an- 
swered or hypotheses to be tested, to define the treat- 
ments to be administered, to select the population to 
be studied and then to build these factors into am 
appropriate experimental design. The problem determines 
the methodology. 

In this situation, however, the attempt to demon- 
strate the practicality of the method being used was 
as important as the knowledge which might be gained 
about the manner in which children learn concepts. The 
order of priority was reversed and the requirements of 
the design dictated, within limits, the variables to be 
studied, the nature of the experimental task, the method 
of presentation, and the population to be studied. 
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Among the major difficulties to bo faced in apply- 
ing the RS model to educational or psychological prob- 
lems is the paucity of important input variables that 
meet the criteria of being continuous, having at least 
an interval scale metric and being capable of exact 
measurement and control. What is needed are approxima- 
tions to the previously described approaches which will 
accoimaodate discrete, ordinal scale variables. One such 
approximation, a modification of the approach suggested 
by Spendley, Hext, and Himsworth (1962) for the sequential 
use of simplex designs in optimization processes was 
followed in this study. 

The essential difference betx^een the design used by 
Spendley ^ al . and the modification used here is that 
they used fractional valued combinations of the various 
independent factors resulting in sets of points which 
formed the vertices of a regular simplex with edges of 
unit length, while in the modification proposed here 
one is forced to use irregular simplexes with integer 
valued vertices. For example, in a two factor situation, 
Spendley v/ould begin vjith tho regular simplex with ver- 
tices at tho points (0.00, 0.00), (0.97 t 0.26), and 
(0.26, 0.97 )f as contrasted to the irregular simplex 
vJith vertices at (0,0), (1,0) and (0,1) necessitated by 
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of (k+1) experimental combinations, where k is the num- 
. ber of independent variables under investigation. Rola 
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where the k rows are the k independent variables and 
the (k+1) columns are the experimental combinations at 
the (k+1) points. These define a simplex in k dim- 
ensions, a simplex which forms one of the (k+1) faces of 
the k-dimensional simplex design. 



The general experimental plan calls for the run- 
ning of experiraonts at each of the experimental combina- 
tions represented by the columns of D, collecting response 
measures at each of the combinations, and then makii^g 
decisions leading to more optimal experimental combina- 
tions based upon the data which have been collected from 
the first (k+1) experiments. The rules needed for these 
decisions, specifically concerned vjith v/lion to move, in 
what direction to move and hov/ to calculate the new com- 



binations follovj those described in Chapter Two (pages 



lll“3 ) . 
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1. One moves, i.e., changes, the experimental 
combinations for each new trial after the 
basic (k+1) combinations have been run. 

2. One moves away from the combination resulting 
in the least optimal response by discarding 
that combination from the simplex and re- 
placing it by a new point on the opposite 
side of the remaining face. 

3. Following the suggestion of Spendloy, the 

% 

procedure for calculating new combinations 
was as follov/s. For each variable, choose 
that integer valued level which is closest 
in numerical value to twice the mean level 
of the variable in the k combinations being 
retained minus the level of the variable for 
the point being discarded. 

The aim of this calculation is to produce a new 
simplex in the variable space which will be a mirror 
image of the first. With discrete valued variables, 
one is forced to approximate fractional values with 
the nearest integer valued point. This results in 
each set of (k+1) points defining a simplex of somewhat 
different shape. This difficulty is not immediately ap- 
parent in a two dimensional problem because all mirror 
images, of integer valued triangles v/ill also have integer 
valued coordinates. 
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In three dimonsione this is not necessarily true. 
The mirror image of tho simplex, D^, formed by reject- 
ing the point (0,0,0) is with the new point, 

(2/3, 2/3, 2/3). 
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The closest integer valued point to (2/3, 2/3, 2/3) is the 
point (1, 1, 1) which results in a simplex, , of slight- 



ly different size and shape than that defined by the ori- 
ginal set of points. Furthermore, the amount of distor- 
tion that occurs from this substitution of an integer 
valued point for a fractional valued one is not constant 
and becomes potentially greater as the dimensionality of 
the system increases, ’’potentially” must be stressef^ as 
the distortion which actually occ\irs will be a function 
of the particular sequence of simplex designs which the 
data collected dictates. 

Descrip tion of the Variables Studi ed 

For the purposes of this study the eight variables 
included in the experiment were defined as follows. 

Amount of redundant information in tho typical con- 
cept learning experiment refers to that information in 
excess of some minimal amount which the subject r©Q.uiros 
to learn the concept. The ability to specify this 
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minimal amount of information nooded to learn the con- 
cept is, however, a consequence of the complete spec- 
ificity of the set of entities used as exemplars or 
non -exemplars of the concept. In the real world, the 
number of discriminable exemplars or non -exemplars of 
a concept is always indeterminate and one has no way of 
knowing whether or not the correct categorization of some 
future instance will require attending to some attribute 
which to that time has been deemed irrelevant. This is 
equivalent to saying that real concepts are never com- 
plete and, thus, additional exemplars of a concept are 
never completely redundant in the information sense. 

Since it was the intention of the experimenter to build 
this indot ©rminateness into the sets of instances u/?ed, 
the amount of red^mdant information was taken to bo 
relative and assumed proportional to the number of dis- 
criminable instances presented. Thus, a set of six in- 
stances was said to have more redundant information than 
a set of three even though the absolute amounts of in- 
formation needed could not bo specified. 

Mode of presontati on of s ucoessiv© instances refers 
to the context in vihich instances are presented. They 
may be presented in isolation, in conjunction with the 
first exemplar or positive instance, in conjunction with 
all previously presented exemplars or within the total 
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array of availablo instances • Note that these may bo 
ordered in terms of the amount of prior information 
about the concept that is presented with each successive 
instance. 

The ratio of posi tive and neg ative instances is 
a particularly interesting variable in that experimental 
findings point out that negative information is particu- 
larly difficult for some people to process (see Hovland 
and Wiese, 1953 )> while a logical analysis of the inde- 
terminate real world v/ould indicate that some negative 
information is necessary if one is to categorize that 
world in meaningful ways. 

The order of positive and negative ins tances has 
received little attention in the literature. Nonetheless, 
it seems reasonable that it should make a differenco 
whether instances are presented with all the positive 
instances first, with all the negative instances first 
or in some interspersed order. 

This vari able was included primarily because it 
provided added complexity for the testing of the simplex 
design with little added effort in the construction of 
treatment materials. 

The use of verbal cues with varying amou nts of 
information was suggested by the vjork of Wittrock ^ . 




( 1964 )•. The cues used in this study wore verbal state- 
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merits emphasizing certain aspects of the visual instance. 
While no information was presented that was not con- 
tained in the visual presentation, the higher level ver- 

V 

bal cues operate to focus subject's attention on relevant 
or defining attributes of the concept. 

Length of tirao instance was available to subject 
and length of time between instances were variables which 
were included primarily because of the ease with which 
they could be manipulated. Their primary function was 
to complicate the design and provide a more strenuous 
test of the simplex design and response surface model. 

With finite sets of instances there is some indica- 
tion that more complex concepts, i.e., those with more 
relevant dimensions, are more difficult to learn. As 
with the amount of redundant information variable, tho 
specification of the complexity of a real world concept 
in any absolute sense is impossible and this variable 
makes sense only inasmuch as one considers the rel ativo 
complexity among concepts. This is possible with the 
several concepts used in this study. 

Experimental Task and Materials 

In the introductory remarks in Chapter I, dissatis- 
faction was expressed concerning the artificiality of 
many experimental analogs of concept learning behavior 
and specific comments were made about three specific 
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doficienoies which were seen to exist in certain lab- 
oratory paradigms which are widely used in concept learn- 
ing studies. These paradigms v/ere seen to be inadequate 
representations of what educators would call a concept 
learning task in that the tasks studied are usually: 

1) lacking in any requirement that the individual 
subject reorganize his environment in any novel 
way, 

2) couched in vocabulary that is quite familiar to 
the subjects, and 

3) presented in the context of a finite system of 
attributes having a quality of unambiguity not 
present in the real v/orld. 

Furthermore, it was suggested that the phenomena of 
interest was actually a somewhat permanent change in 
cognitive behavior and thus an outcome measure of reten- 
tion rather than one of immediate learning was desirable. 

However, in spite of these difficulties with much 
previous experimental work in the area of concept learn- 
ing, it is true that much, if not most, of the empirical 
evidence that psychologists have collected about concept 
learning has been in the context of such experimental pro- 
cedures. To make use of this wealth of experimental work 
requires that one use an experimental method vjhich is 
similar enough to what has gone before to mai^:e use of that 
which others have learned. 
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The dovelopmont of the specific set of concept 
learning tasks used in this study represent the sim- 
ultaneous effort to satisfy several somewhat conflicting 
criteria. The tasks had to be reasonable analogies of 
the "learn the rule by which this collection of figures 
has boon dichotomized" type of experiment. In addition 
they had to overcome the objections about the artifici- 
ality of these tasks which have been enumerated. Fin- 
ally, the tasks or experimental treatments had to lend 
themselves to inclusion in the simplex design approach 
to a response surface model as has been described and to 
the study of the eight independent variables listed above 
Since the need to overcome the artificiality present 
in most card-sorting or concept-board type of concept 
learning tasks was seen as the most important of these 
criteria, a decision was made to choose tasks which would 
consist of attempting to learn things which are typi- 
cally taught in schools and about which there would be 
little disagreement as to their being correctly termed 
concepts. A major difficulty with this approach is the 
problem of getting a large supply of naive subjects — 
subjects who would enter the experimental situation know- 
ing nothing of the concepts to be presented. One method 
of handling this difficulty would be to present concepts 
usually taught at one grade level to children at some 
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earlior grade level. This, in turn, requires that the 
concepts be ones that younger children are development- 
ally ready to handle and that are primarily school- 
taught concepts rather than the type the child could 
possibly have been introduced to via radio, television, 
newspapers, motion pictures or adult conversation. 

The requirement that the tasks sufficiently re- 
semble the typical card-sorting or concept-board type of 
task so that analogies could be drawn between the two 
meant that the set of concepts to be learned had to be 
such that it would be possible to specify the relevant 
attributes and the values of said attributes which de- 
fine the concepts. Moreover, as one varies the attri- 
butes or values of the attributes, it should be true 
that one is creating exemplars of different concepts. 

The third criterion is that the materials used had 
to fit the simplex design being studied and had to adapt 
to the specific set of variables chosen for inclusion 
in the design. Since this design calls for the deter- 
mination of the (k+2)’^^ and succeeding experimental 
treatments only after the first (k+1) experiments have 
been run and since the total number of possible combina- 
tions that might be dictated by the basic set of com- 
binations easily rims into the thousands,*^' the tasks 



'^Soven factors with five levels of each results in 
a complete factorial experiment with 78jl2i> experimental 
combinations. 



chosen had to bo such that a new experimental treatment 
combination could be put together on relatively short 
notice, in some cases overnight. 

The first solution that suggested itself was the 
use of variations of colors, numbers and shapes of geo- 
metric forms that have been used in many concept experi- 
ments with the difference that these materials bo used 
with children of a sufficiently young age that the child- 
ren would indeed need to be learning a label for the 
concept and organizing their environment in novel ways 
while they wore learning the concept. This, however, 
would have necessitated the use of pre-school or kinder- 
garten age children and forced an. individually administrated 
type task. This approach was eliminated on the grounds 
that it was not practical with the large numbers that 
seemed called for to provide sufficient precision in the 
estimation of responses. Another alternative considered 
was the use of a set of political concepts such as com- 
munism, socialism, democracy, dictatorship, etc. but this 
was rejected because of the difficulties in preparing a 
wide variety of exemplars of the concepts, in being able 
to identify precisely the defining attributes and in find- 
ing a population of students that was development ally 
ready to work with concepts of this kind but which had 
not already had widely varying exposures to these concepts 
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via outside-of-school sources of information. Similarly, 
sets of language concepts and science concepts were re- 
jected and the focus came back to mathematics, specif- 
ically geometry, in which concepts would lend themselves 
to visual presentations which could be carefully con- 
trolled. 

The plane geometry concepts of trapezoid, isosocles 
triangle, quadrilateral, and rectangle were finally chosen 
to bo learned by fifth- grade -age youngsters. These are 
concepts which have typically been tau^t in the sixth, 
seventh, and eighth grades which satisfies the criterion 
of being real concepts that schools attempt to teach. 
Furthermore, the newer, so-called modern math, approaches 
to teaching elementary school mathematics have pushed the 
first introduction of these concepts down into the third, 
fourth fnd fifth grades giving support to the contention 
that children of this age are ready, in a developmental 
sense, to acquire these concepts. In addition, these 
are concepts to which children do not get any appreciable 
exposure outside of school instruction. Finally, in the 
spring of 1965 there still remained numerous school systems 
in southwestern Wisconsin that had not made the transi- 
tion to arithmetic series which incorporated the modern 
math approach and thus i^rovided a large potential popula- 
tion of fourth and fifth graders for this study. Thus, 



the criterion of being real concepts for which a large 
number of naive subjects could be found was satisfied* 

A second criterion was that the relevant dimensions 
of the concepts be clearly specifiable and that vary- 
ing the relevant attributes or the values of relevant 
attributes produce new an,d meaningful concepts so that 
close analogies could be made to the great bulk of con- 
cept learning studies which have been reported over the 
past fifteen years. In plane geometry we have a system 
which is quite rigorously and unambiguously defined* 

For example, all plane figures may be categorized as 
open or closed, all closed figures as polygons, non- 
polygons or composites of the two, and all polygons into 
a variety of categories or concepts depending upon the 
nuTiiber of sides, the size of angles and the spatial re- 
lationships among sides and angles* Thus, within the 
context of a system of piano figures, one has a universe 
of concepts which, for practical purposes, is defined by 
a finite set of attributes or dimensions, each with a 
finite set of values* Yet, there exists an infinite 
number of perceptually different exemplars and non- 
exemplars of each concept* This infinity of exemplars 
is the result of the system including a number of ir- 
relevant attributes, some of which may take on an in- 
finity of values* Two of these are the size and spatial 
orientation dimensions* 
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while it makes sense to speak of an elemento.ry 
school age child having a concept of a geometric figure 
such as a triangle, note that this is a relatively low 
level or unsophisticated concept of triangle compared 
to that of a high school geometry student or of a pro- 
fessional surveyor or of a mathematician specializing 
in geometry. Whether these differences should be ex- 
plained as a series of steps in the complete learning 
of a concept or as a set of distinct concepts with a 
single label will not be argued here. It does, however, 
emphasize the problem of the relativeness and indeter- 
minateness of that which educators would call concepts. 

A third criterion had to do with the practical 
aspects of using these geometric concepts within the 
framework of the response surface exploration model of 
a sequence of experiments with the treatment combina- 
tion at any point in the sequence being determined by 
responses to preceding experiments. By presenting the 
exemplars of the concepts by means of slide transpar- 
encies, a wide variety of treatment combinations was 
possible from a relatively small pool of photographic 
material by superimposing two or more transparencies 
in the same slide mount. Certain of the other variables 
included had to do v;ith auditory cues tind varying times 
which was controlled by a tape recorder which presented 
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verbal cues while regulating the pacing of slide pre- 
sentation. 

The concepts chosen to be used in the experiment 
were those of square and polygon, which were used as 
warm-up tasks, and quadrilateral, isosocles triangle, 
trapezoid, and rectangle which were presented as con- 
cepts to be learned and tested upon. 

Of the eight variables designated to be varied 
during the set of experiments, seven were varied from 
experiment to experiment and the eighth, that relating 
to the relative complexity of the several concepts, 
within each experiment. At least five possible levels 
of the first seven factors were identified and three 
levels of the eighth. The factors and levels of each 
are presented in Table 2. Since all of the factors 
or variables were assumed to be at least ordinal in 
nature, they were coded nvimerically as -1, 0, 1, 2, 3. 

This choice of code was preferred to a 1, 2, 3, $ code 

because it allowed the original simplex to be oriented 
about the combination, (0,0,0,0,0,0,0 ) , and permitted 
movement of the simplex design in all directions from 
the original set rather than only in a positive direction. 

The actual choice of levels used is arbitrary and 
was based solely upon the Judgment of the experimenter. 
Where empirical data is available it, of course, should 
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be given careful consideration in the making of these 
choices. In this situation, the existing empirical 
evidence was only tangentially related to the type of 
presentation being made. The decisions were based upon 
examination of the introduction of these concepts in 
several arithmetic textbook series, consultation with 
mathematics curriculum people, experience in analyzing 
data from concept attainment experiments performed at 
the Learning Laboratory of the Educational Psychology 
Department at the University of Wisconsin, and considera- 
tion of the length of time one could hold the attention 
of a group of fifth grade children. Following the sug- 
gestions of Spendley ejb al . the levels were chosen so 
that a unit change on any one could reasonably bo ex- 
pected to result in a measureable shift in the response 
measure. 

Thus, on the redundancy of information factor 
increments of three instances seemed adequate to pro- 
duce measureable changes, if indeed this variable was 
important to the learning of the concepts. By using 
six instances as the zero level, the possibility of mov- 
ing the simplex in a negative direction, i.e., to pre- 
sentation of sets of three, two, or one instance was 
allowed. 

Tho second factor is mode of presentation. Five 
modes that have been used in concept learning studies 
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formed the five levels of this factor. Theoretically, 
these five modes convey varying amounts of information 
to the subject and on this basis they may be ordered. 

That mode which contains the least information is the 
presentation of instances one at a time with no oppor- 
tunity to refer to instances previously presented. A 
more informative mode of presentation is to identify 
the first positive instance of the concept as a focus 
instance and then present new instances in conjunction 
with this focus instance, thus providing a referent for 
the new instances. A further step is to present each 
successive instance in conjunction with all previously 
presented instances. A fourth level is to present a 
sufficient set of instances for the learning of tho con 
cept and to identify successive instances within tiu) 
context of this set. A fifth level is the presenta- 
tion of instances within the context of a much larger 
set than that identified as a sufficient set. In the 
laboratory type problem v/ith a concept board this array 
is typically the entire universe of allovjable instances. 
With a real world concept it is not possible to iden- 
tify either a sufficient set or a complete set of ex- 
emplars of any one concept. To approximate the fourth 
mode of presentation a sufficient set of instances needed 
to define each concept, given that the instances were 
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plane figures, was devised and a larger display con- 
sisting of several of these pseudo -sufficient sets was 
used for level five. Sketches of the sets of figures 
used for the concepts trapezoid, isoceles triangle, 
quadrilateral and rectangle are presented in Appendix A. 

The third factor is that of the ratio of positive 
to negative instances of the concept being learned. The 
five levels chosen were those of all positive instances, 
all negative with the exception of a single positive 
focus instance and three intermediate ratios of positive 
to negative, 2:1, 1:1, and 1:2. Those levels were or- 
dered on the basis of number of positive instances. 

The fourth independent variable had only four levels. 
Pour orders in which instances can be presented are to 
present the positive instances first, followed by the 
negative ones, the negative first followed by the posi- 
tive or an interspersed arrangement where the order is 
either random or deliberate alternating of positive and 
negative. The ordering of these four levels was done 
by examining them in sets of three and asking which 
seemed to be an intermediate betv;een the other two in 
terms of expected response. The difficulties with this 
ordering became apparent early in the experimental part 
of the study and the factor was eliminated in the con- 
traction of the design after the basic set of experi- 
mental conditions were run. 
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Tho five levels of verbal cue to be used consisted 
of silence, the statement ’’This is (is not) a ...” 
followed by tho appropriate name of the concept, the 
above statement with a one-word cue ("count” for the 
quadrilateral, "equal” for the isosceles triangle, 
"opposite" for the trapezoid and "corners" for the rec- 
tangle), the above statement followed by a stronger cue 
("count sides" for the quadrilateral, "equal sides" for 
the trapezoid and "square corners" for the rectangle), 
smd a complete definition of the concept. 

The two time factors, time each slide was pro- 
jected, and time between slide presentation of suc- 
cessive instances, were measured in seconds with slides 
projected for multiples of four seconds and the time 
between slides being multiples of two seconds. Con- 
sideration was given to the idea that times used might 
more closely represent an equal interval scale in terms 
of psychological impact if the levels were chosen as 
some exponential or geometric progression rather than 
an arithmetic one but this was rejected because it 
would have led to longer experimental sessions with the 
subjects than seemed practical for children of their age. 

The eighth variable studied, the relative complexity 
of the concept, differs from the others in that it is 
not related to presentation of the task as are the first 
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seven. Instead it represents a change in task. Vari- 
ation of this factor could have been carried out be- 
tween experimental treatments by using only one concept 
per experiment or by using several of the same theor- 
eticc.1 complexity, but it was believed that more could 
be learned about this factor by varying it within each 
experiment. As this meant multiple responses to any one 
experimental treatment, one level was chosen as the 
level of complexity to be used as the criterion response 
for the attempt to optimize on the other factors. 

Since all the concepts used in this study are 
special cases of the concept, polygon, it is sufficient 
to specify the complexity of polygon as k and then specify 
the complexity of the several concepts used relative to 
k. Thus, a triangle is a polygon with a single added 
restriction as to the number of sides and therefore may 
be expressed as having complexity (k+1). An isosceles 
triangle is a triangle with the restriction that two 
sides (or two angles) be equal and therefore is of com- 
plexity (k+2). Similarly a quadrilateral is a four- 
sided polygon, complexity (k+1); a trapezoid is a quad- 
rilateral with two parallel sides, complexity (k+2); 
and a rectangle is a trapezoid with right angled corners, 
complexity (k+3). Thus, if the complexity of a concept 
is important to the learning of that concept, quadri- 
lateral should be easier to learn than either isosceles 
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triangle or trapezoid and all three should be easier 
to learn than rectangle. 

The Experimental Procedure 

The actual experimental treatment administered to 
any given group of subjects consisted of a short twenty 
to forty minute presentation. This presentation began 
with an informal statement by the experimenter"^' in 
which he introduced himself > briefly explained the con- 
nection with the University of Wisconsin, made some 
general comments about the weather, school, etc., to 
relax the children, and explained, generally, the pur- 
pose of the experiment was that of trying to find out 
some things about the ways in which youngsters learned. 

A short explanation of the synchronized slide and tape 
recording equipment was given to satisfy the children’s 
curiosity as to how the machinery operated and thus 
have greater attention given to the actual presentation. 
With a final exhortation to pay close attention to the 
slides and tapes so as to learn as much as possib3.e, 
the lights were turned off and the treatment presented. 
This introductory session was deliberately kept informal 
and somewhat uncontrolled in an attempt to get the 
youngsters into a relaxed frame of mind. 

'"'All experiments were conducted by the author or 
Mary Davis of the staff of the Center for Learning and 
Re-Education at the University of Wisconsin. 



This is a TRIANGIiE 




FIGURE 2 



Focus instanco slid© for concept, trianglo* 
succession of instances used to demonstrate 



and the 
the concept. 



The slide and tape presentation (the complete 
transcript appears in Appendix B) began with a set of 
instructions specifying that the purpose of the task 
being presented was to discover what there was about 
certain types of geometric figures which resulted in 
their having a common name. This was followed by a 
practice sot of five slides, all of which were positive 
exemplars of the concept triangle. Figure 2 shows the 
focus instance (the first positive exemplar) as it 
appeared on the screen and sketches of the other four 
exemplars used. Note that with triangle, as with all 
sets of instances, the attempt was made to present as 
wide as possible variety of triangles. Obtuse and 
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acute triangles, scalene, isosceles and equilateral 
triangles, triangles of various sizes and spatial ori- 
entation^ and triangles as either solid or outline fig- 
ures with lines of differeing widths vjere used. With 
each slide presented, the title, ’’This is a TRIANGLE” 
was simultaneously read via the tape recording. At the 
end of the sequence the children were asked if they now 
knew what made a figure a triangle. After a pause they 
were told that a triangle was any throe sided figure. 

A second practice set designed to elicit the con- 
cept, Square, was then presented. This practice set 
differed from the first in that it contained negative 
as well as positive instances of the concept. An ex- 
ample of one of the negative instances as well as the 
complete set of nine slides is presented in Figure 3 « 
Again the youngsters were asked if they now understood 
what a square was and were then told that it was a fig- 
ure with sides of equal length and square corners. It 
should bo noted that toward the end of this practice 
sequence the children tended to become restless over the 
seeming triviality of the task. However, the appearance 
of the rectangle (which was not too different from a 
square) as a negative instance apparently forced many 
to focus more sharply on the materials being presented 
and emphasized the non- trivial! ty of the task. 
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This is not a SQUARE 




FIGURE 3 

Nogative instance of concept, square, and the 
series of instances used to demonstrate the concept. 

The children were then told that the following 
figures would be much more difficult and then, in this 
order, were presented with sets of slides exemplifying 
the concepts of quadrilateral, isosceles triangle, trape- 
zoid, and rectangle. These sets were systematically 
varied on the independent variables previously described. 
After each set the lights were tui'ned on for one minute 
and the students were asked to perform an activity de- 
signed to reinforce what they had learned from the pre- 
sentation. After the quadrilateral and trapezoid sets 
they were asked to try to write a definition of what they 
thought a quadrilateral or trapezoid v/as and after the 
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other two they were requested to try to draw two dif- 
ferent examples of isosceles triangle or rectangle. 

At the completion of the rectangle concept set 
the youngsters wore told that the experimenter would 
be back the following day to see hew much they had 
learned from watching the slides. It was suggested 
that it was what had been learned from the slides that 
was of interest, not what they could learn from their 
teacher, each other, their parents, books, or the other 
students who had not seen the slides. In the light 
of this interest they were requested to not talk about 
the experiment until after the experimenter returned, 
at which time they would receive a more complete ex- 
planation of what the experimenter was trying to ac- 
complish. The following day, at approximately the same 
time of day, a return visit was made to the school and 
the response measure, a sixty- item test, v/as administered 
to both the control and experimental groups. Following 
the test, the experimenter defined the four concepts to 
the youngsters, explained how one oould use the positive 
and negative instances to determine the definition and 
answered questions about the procedure. 

The Response Measure 

In the concept-board or card-sorting type concept 
learning experiment, the response measure has typically 
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1. circle each figure below which is a QUADRILATERAL, 
Do nothing to the other figures. 




FIGURE 4 

Item number 1 of the Geometric Figure Identification 
Test used to test for the concept, quadrilateral. 



been some concomitant of immediate learning. The sub- 
ject is expected to verbalize the rule which defines 
the concept, i.e., divides instances into exemplars and 
non-exemplars of the concept, or to do some task, such 
as sort cards, from which it may be inferred that the 
subject has acquired the rule. This dichotomy of re- 
sponses is often converted into scores by considering 
some concomitant of success or failure, such as the 
number of trials needed, the amount of potential infor- 
mation received, the time required, or the nur.iber of 




errors made. Typically, no attempt has been made to 
measure retention of the concept. This, of course, is 
not surprising since the artificiality of these tasks 
do not lend themselves to such a procedure. 

In this set of exporimonts inunediate learning was 
not measured nor were any concomitants of that learning. 
The criterion test, the Geometric Figure Identification 
Test, was given approximately twenty-four hours after 
the running of the experiment. This test, which ap- 
pears in Appendix C, contained four items. Each of 
these items consisted of a set of fifteen geometric 
figures, some of which exemplified the concept being 
tested. The directions were to circle those figures 
which were exemplars of the concept. Figure I4. is the 
item for testing the concept, quadrilateral. 

The Experimental Population 

Several criteria had to be satisfied in choosing 
the population from which experimental units were to 
be chosen. The design called for a sequence of experi- 
ments, between fifteen and thirty in number, requiring 
a large supply of relatively homogeneous experimental 
material. As individual children are quite hetero- 
geneous with respect to cognitive abilities, the use 
of groups of children rather than individual children 
as the units was indicated. A tryout of the criterion 
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test revealed that groups of approximately twenty chil- 
dren vjould reduce the standard deviation among group 
means to approximately one-half unit on the response 
measure. This implied the need for either a pool of 
300 to 600 subjects from which groups could be randomly 
drawn or a pool of l5 to 30 schools which could be ran- 
domly assigned to treatments. 

Another restriction was placed on the choice of 
population by the experimental task. Subjects or schools 
were needed which would provide naive subjects as re- 
gards the four geometric concepts being used. Specif- 
ically, this eliminated school systems which had adopted 
arithmetic tests which introduced these concepts in the 

middle and upper elementary grades. 

A third criterion that had to be satisfied was the 
practical one of having the students in a sufficiently 
compact geographical area to permit the running of the 

experiments with reasonable costs. 

The two large metropolitan school systems within 
feasible geographic limits were both rejected because 
both woi% using arithmetic text series which introduced 
the concepts being used before the sixth grade. Had 
this not been the case, the altornativo of running the 
experiment within a single largo system would still have 
been rejected. The transporting of children noc0ssar»y 
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for tho random choice of subjects from a single largo 
pool of subjects was deemed impractical and the het- 
rogeneity ajoiong schools within a single system was thought 
undesirable. 

The alternative chosen was to identify a lai’ge num- 
ber of school districts representing similar communities, 
choose those that fit the criteria and then randomly 
choose schools to provide the experimental group. The 
specific criteria used follows: 

1. Neither tho arithmetic series being used nor 
tho experience of the state mathematics super- 
visor indicated that fifth grado children 
would have had any instruction on the concepts 
being studied. 

2. The school was the only elementary school 
serving the town and surrounding area in which 
the school was located. 

3. The school was large enough to have at least 
tv/o rooms (approximately I 4 .O children) of fifth 
grade students but no more than three. 

4 . The school was located within ninety minutes' 
driving time of the University of V/isconsin. 

The first of these criteria aimed at providing naive 
subjects. The second provided some assurance that tho 
school populations vjould be relatively similar from 
school to school since each school v;ould serve an entire 
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community. The third criterion made it possible to 
select an experimental group of twenty children and still 
have an equal number to use as a control group so that 
the similarity of the several schools could be tested. 

The upper limit on the size of the fifth grade reduced 
the possibility of drawing a random sample that was un- 
representative of that fifth grades as well as setting 
limits which guaranteed communities within a specific 
size range. The fourth criterion was a practical nec- 
essity in carrying out the logistics of the experiment. 

Thirty schools were identified that met the above 
criteria.*”’ These schools were then put in random order. 
Letters (see Appendix D) were sent to the administra- 
tors of the first twenty schools on the random list 
eliciting their participation in the project. Eighteen 
of the twenty responded favorably, and a nineteenth in- 
dicated that he felt pressured to refuse only duo to 
teacher problems at the fifth grade level. The twentieth 
indicated that he thought research of this type an un- 
warranted imposition on public scnools. As time only 
permitted the completion of eighteen experiments the 
remaining ten schools were never contacted. A complete 
list of the cooperating school systems is found in 



*«Two exceptions were made to criterion 2 in order 
to obtain a sample of 30 • These were in two 2-school 
communities v/here the differences between the schoo3.s and 
the area served by the schools were judged minimal. 
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Appendix E. 

This chapter presented the details and rationale 
for the development of the experimental design, the 
choice of variables to be studied and specifics about 
the levels of the several variables, the selection of 
the experimental task, the development of the experi- 
mental treatment, and the choice of subjects. While 
this chapter has presented those topics as discrete 
entities, this is not how they are developed. Rather, 
the many decisions to be made were formed through the 
simultaneous refinement of all these various aspects. 



CHAPTER IV 



RESULTS 

This chapter, which contains the analysis and 
summary of the data from the sequence of experiments, 
is divided into two parts. The first examines the 
experimental data as it applies to the purpose of em- 
pirically testing the use of the simplex design in- 
attempting to optimize responses in an educational 
experiment. The second examines the data as it relates 
to the previous body of knowledge in the area of con- 
cept learning. 

The set of independent variables studied and the 
codes used to identify the levels of each are des- 
cribed in Table 2 on page 73 * The sequence of sixteen 
experiments that "were performed are numbered from 1 to 
15, with the subscript a indicating a replication or 
re-run due to difficulties connected with the first 
running of that combination. 

The S implex Design in Optindzing Responses 

The aim of each experiment was to present four 
concepts, of three different complexities, to be 
learned by a group of fifth grade children. Amount 
of learning was measured by a Geometric Figure Iden- 
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tifi cation Test in which each child was presented with 
four sets of fifteen figures from which the child was 
to correctly identify the figures which exemplified 
the concept. Thus, each child received four scores 
which varied from 0 to 15 » oue for each of the concepts 
quadrilateral, isosceles triangle, trapezoid, and rec- 
tangle, As has been described in Chapter III, these 
concepts are of three different theoretical degrees 
of complexity, the first of complexity (k+1), the sec- 
ond and third (k+2) and the fourth (k+3). For the 
purpose of studying the effectiveness of the simplex 
model in optimizing responses, the total score on the 
two (k+2) degree concepts, isosceles triangle and trape 
zoid, was taken as the criterion response. As there 
was no assurance that previous arithmetic instruction 
or ability would be the same from one set of subjects 
to the next, each group was randoiiL),y divided into an 
experimental and a control group and the mean score 
of the experimental group was adjusted by subtracting 
from it the mean score of the corresponding control 
group. This measure, the difference in mean scores 
between experimental and control groups on the isos- 
celes triangle and trdpbzoid iteras, was the response 
used in attempting to arrive at an optimal combination. 

The basic simplex design consisted of the eight 
combinations presented in Table 3* Tvjo Number 5 
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combinations are listed because that particular experi- 
ment was replicated when a teacher made some remarks 
at the conclusion of the experimental treatment which 
might have influenced the control group. As the results 
of the replication indicated that these fears were un- 
founded, the first Number 5 combination was used in the 
analysis. The numbers in the several experimental and 



TABLE 3 

The set of eight treatment combinations 
defining the basic -dimens ionB.1 simplex. 





Variables^'* 


1 


2 


3 


4 


5 


5a 6 


7 


8 


1 


Redundancy of info. 


0 


1 


0 


0 


0 


0 


0 


0 


0 


2 


Type of presentation 


0 


0 


1 


0 


0 


0 


0 


0 


0 


3 


Ratio of + to - 


0 


0 


0 


1 


0 


0 


0 


0 


0 


k 


Order of + and - 


0 


0 


0 


0 


1 


1 


0 


0 


0 


5 


Verbal cue 


0 


0 


0 


0 


0 


0 


1 


0 


0 


6 


Time per slide 


0 


0 


0 


0 


0 


0 


0 


1 


0 


7 


Time between slides 


0 


0 


0 


0 


0 


0 


0 


0 


1 


N 


in Experimental group 


24 24 21 


24 21 


21 


20 


26 24 


N 


in Control group 


43 32 


33 


65 35 


77 


25 33 79 



* Complete code for variables and levels are found in 
Table 2 on page 73* 



•K-sf Experiment 5 was replicated. 



control groups are also presented in Table 3* While 
the original decision was to have twenty children in 

each experimental group, these numbers are between 20 

? 

and 26 as additional experimental subjects were chosen 
to guarantee that absences on the test day would not 
result in less than 20 in a group. 
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The results of these nine experiments are pre- 
sented in Table 4. The mean scores for the experimental 
groups were in the range, 19.33 to 22.85. Those com- 
binations which contained higher levels of verbal cue 



TABLE k 

Summary statistics for the basic set of 
experimental combinations on the combined score 
of the isosceles triangle and trapezoid items. 



Experimental 
Combination E 

Numb er 


c 


Spooled 


(i-c)* 


®E-C 


I 


21.04 


■' " 16.21 


TTS 3 


4TB3 


.92 


2 


21.58 


16.72 


3.1+2 


4.86 


.92 


3 


20.90 


15.97 


3.53 


1+.93 


.99 


k 


20.50 


11+ -99 


3.00 


5.51 


.72 


5 


19.33 


15.60 


3.19 


3.73 


. 88 


5 a 


19.71 


15.62 


3.63 


1+.09 


.89 


6 


22.85 


l 6.)+0 


3.51 


6 . 1+5 


1.05 


7 


21 . 1|.6 


II+.23 


3.1+8 


7.23 


.90 


8 


19.66 


15.84 


3.27 


3.82 


.76 



«*All differences are significant at p<.001. 



(Number 6), redundancy of information (Number 2), and 
time per slide (Number 7) produced the largest absolute 
responses while those which were set at higher levels 
of order of presentation (Number Ssl and 5b) and in- 
» creased time between slides (Number 8) resulted in the 

smallest responses. This was consistent vjith the re- 

ft 

suits in terms of the corrected response, E-S. 

The mean scores of the control groups are in the 

range, l4-*23 to 16. 72. The expected score, under the 
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assumption of no knowledge and independent guesses on 
each item, would be l^.OO. Every control group had 
one or more members with scores of 23 to 26 auggest- 
ing at least partial knowledge of the concepts. This 
is to be expected, particularly on the isosceles tri" 
angle concept to which children could react solely to 
the word "triangle." The result is mean control group 
scores slightly above chance levels. 

Examination of the standard deviations of the 
experimental and control groups revealed no consistent 
pattern that would indicate that the experimental treat- 
ments were operating to either increase or decrease 
the variability among subjects. Therefore, a pooled 
estimate of the standard deviation was calculated for 
each experiment. These pooled standard deviations were 
quite consistent, all being betvieen 3.00 and 3 . 63 . 

The differences between mean experimental and con- 
trol scores ranged from a low of 3.73 for combination 
5 to a high of 7*23 on combination 7* The standard 
errors of the differences in mean scores all fell be- 
tween .72 and 1.05> and all the obtained differences 
between experimental and control group means v;ere at 
least ii-*23 standard errors in size. Translating this 
into tests of null hypotheses of no difference between 
experimental and control group means, all the differences 
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between experimental and control groups are significant 

(p<.001). 

Of this basic set of eight combinations, the mini- 
mal response in terms of difference between means of 
experimental and control groups was from experimental 
combination 5> a difference of 3«73 points. Thus, the 
decision was made to reject this combination for a new 
one. 

Following the rule suggested earlier, i.e., sub- 
tract the coordinates of the discarded point from twice 
the mean of those from the remaining points, resulted 
in the set of values (^, -1, 3^). Round- 

ing these values off to the nearest integer values gives 
combination Number 9 : (0,0,0, -1, 0,0,0 ) . The experi- 
ment run at this combination produced a mean of 21.37 
for the Experimental group and a mean 15.78 for the con- 
trol group. This led to a difference between the means, 
(E-C), of 5«59 which, with a standard error of l.Oiij 
was significant with p<.001. This mean difference was 
greater than that obtained for combinations 1-4 and 8. 
Among the responses from 1-4 end 6-9, that from com- 
bination 8 was low and should have been the next com- 
bination replaced. 

At this point pressures of time forced a reduction 
in the complexity of the experimental treatments. 
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Delays in getting additional photographic materials 
processed, the re-running of combination and post- 
ponements requested by schools had cost over two weeks 
of time which could not be made up because the experi- 
mental period was limited by tho closing of schools in 
late May. A decision was made that more could be 
learned in the period of time remaining by reducing 
the number of variables from seven to four, on the 
basis of their intrinsic interest to the experimenter 
as well as on the results of the ten experiments which 
had been completed, the variables of ratio of positive 
to negative instances, order of positive and negative 
instances, and time between slides were fixed at the 0 
levels of each. Combinations Ij., 5> and 8 were removed 
from the original design leaving combinations 1, 2, 3> 
6, and 7 to define a four -dimensional simplex. This 
reduction of the design from seven to four dimensions 
and its effect on succeeding experiments is given in 
Table 5 * 

Of these five combinations. Number 1 had the low 
response. Recalculating tho next experiment resulted 
in the combination (hththth) vrhich rounded off to 
(1,1, 1,1) and is Number 10 in Table 5* A sequence of 
five additional experiments were run and the design of 
each is given in Table 5 * Combination '11 replaced 2j 
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12 replaced II5 13 replaced 12; lij. replaced 3 ; and 15 
replaced 1^. If time had been available to run a longer 
sequence^ 1 $ would have been the next combination re- 
placed. The final simplex consisted of the combina- 
tions 6, 7, 10, 13, and 15. 

The statistics from these additional combinations 
are given in Table 6. The pooled standard deviations 
for combinations 9 through 15 are of the same approxi- 
mate size as those from the basic set of combinations 
as are the standard errors of the difference in moans. 

The response to Number 11 demonstrates the useful- 
ness of the simplex design in the presence of experi- 
mental error. The difference in mean scores, (B-C), 
is .78, the only difference which is not significant 
(p<.01). In talking to the teachers after this ex- 
periment was completed, it vjas revealed that the teach- 
ers in this school had spent several days teaching 
these concepts earlier in the semester. If this had 
not been revealed by the teachers, it would have been 
obvious from the analysis of the test results. While 
this evidence was sufficient to justify discarding 
that experiment and replicating it on another group of 
students, this decision was not made. Instead the 
result was allowed to stand; the idea being to observe 

to what results the decision rules would lead. The 

, 

response to Number 11 being low it was replaced by 
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TABLE 6 

SUDimary statistics'^ on the isosceles triangle and 
trapezoid items for all experimental combinations. 



Experimental 

Combination 


B 


c 


E-C 


Spooled 




1 


21.014. 


16.21 


4.83 


3.63 


.92 


2 


21.50 


16.72 


4.86 


3.42 


.92 


3 


20.90 


15.97 


4.93 


3.53 


.99 


k 


20.50 


14.99 


5.51 


3.00 


.72 


$ 


19.33 


15.60 


3.73 


3.19 


.88 


5a 


19.71 


15.62 


4.09 


3.63 


.89 


6 


22.85 


16.40 


6.45 


3.51 


1.05 


7 


21.46 


14.23 


7.23 


3.48 


.90 


8 


19.66 


15.84 


3.82 


3.27 


.76 


9 


zy.yi 


15.78 


5.59 


3.26 


1.04 


10 


20.66 


13.15 


7.16 


3.63 


.95 


11 


21.00 


20.22 


.78 


3.59 


1.12 


12 


21.30 


16.46 


4.84 


3.25 


.97 


13 


22.18 


17.05 


5.13 


3.78 


1.01 


Ik 


ZO.li 


17.26 


3.47 


3.88 


1.16 


15 


19.71 


15.79 


3.92 


2.85 


.90 



Maximum score was 30. 
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Niuubor 12 which, it should be observed, is the same as 
Number 2, which 11 had replaced. The difference in mean 
scores, (E-C), was I 4 .. 84 . which was again low and led to 
Number 13 which is identical to combination Number 11. 
Thus, our decision rules led to the same end result as 
common sense, i.e., combination Number 11 should be 
performed on a new group of subjects. This time the 
combination produced a response, (E-C), of 5«13 which 
was not low. 

The final two experiments consisted of replacing 
3 with li|., and, when that proved low, replacing 14 with 
15 which was identical to 3» Fifteen also was low and 
would have been replaced had there been time to con- 
tinue the sequence. In calculating combination 14 the 
values (^g,0,l^,l^) were rounded off to (1,0, 1,1) instead 
of (1,0, 2, 2) because the latter value, in the judgment 
of the experimenter, resulted in too large an increment 
with corresponding distortion of the shape of the sim- 
plex. That experiments 14 and 15 led to this same set 
of calculated values indicates that experimenter * s 
judgment was probably wrong and the l6th combination 
would have been (1,0, 2, 2). 

The five combinations, 1, 2, 3> 6, and 7> v/hich 
formed the basic simplex in the reduced fotir -variable 
system had a moan response (mean of (E-C)) of 5 , 66 , 

The four combinations (omitting 15 which would have 



101 



been replaced) left in the final simplex had a mean re- 
sponse of This represented an increase of 0*83 

over the sequence of six additional experiments. 

While the sequence of experiments was short, five 
in the basic set and six additional ones, several points 
should bo noted. The mean response of the final sim- 
plex was nearly one standard error greater than the 
mean response of the initial simplex. Reducing the 
number of variables from seven to four at the comple- 
tion of experiment 9, demonstrated the manner in which 
the design can be collapsed. That combinations 11 and 
12 and li|. and l5 merely offset one another is evidence 
that the decision rules will work in the presence of 
experimental error. Since none of the eleven experi- 
ments in the reduced design involved more than two 
levels of any one factor, the advantage over a two- level 
factorial design is not readily apparent. However, when 
one considers that one of the experiments, combination 
11, could have been eliminated on the basis of reports 
from the teachers involved and that the next experi- 
ment in the sequence would have been at a third level 
of two of the factors, the potential saving in experi- 
mental effort over a 2^- or 3^ factorial is more obvious. 
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^j 0 Results Applied to Concept Learning 

As was detailed earlier eight variables were iden- 
tified and built into the experimental design, in part 
because of their relevance to concept learning and in 
part because of the ease with which they could be man- 
ipulated in an empirical test of the simplex design 
model for optimizing responses. Sixteen experiments 
were performed with the levels of each of these vari- 
ables determined by the criterion of obtaining the 
maximum possible response over a set of points held 
in a specific configuration relative to each other. 

This criterion does not lend itself to the simultaneous 
investigation of the tenability of specific hypotheses 
about the variables although it seemed reasonable thai. 
much useful information about the relationship between 
concept learning and the independent variables would be 
forthcoming as a by-product of the investigation of 
the simplex design. 

Table 7 presents mean scores on all four concepts 
for both experimental and control groups for all six- 
teen experiments. The differences between those mean 
scores, standard errors for those differences, t-ratios 
and significance levels appear in Table 8 . A general 
picture of the data in Table 7 can be obtained by study- 
ing the modians of the group means vjhich are reported at 
the bottom of the table. In general, both experimental 
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Median 10.59 9.06 10.00 8.26 10.00 7.01 12.02 10.9ii. 



and control groups did boat on the concept, Rectangle, 
with tho moan scores of all 32 groups being above the 
chance expectation of 7»50> with the control groups in 
a range from 9*39 to 12.00 and tho experimental groups 
from 9.52 to 12.95* On the concept. Quadrilateral, 
control groups also did considerably better than chance 
with mean scores in the range 8.53 to 10.13* Mean scores 
for the experimental groups ware between 8.46 and 12.00. 

The tvjo concepts of medium complexity, Isosceles 
Triangle and Trapezoid, proved more difficult for both 
control and experimental groux)s with the controls aver- 
aging only slightly better than expected by chance with 
tho median of the control group means being 7*81 for 
Trapezoid and 8.26 for Isosceles Triangle. These two 
concepts also produced the greatest differences be- 
tween controls and experimentals . On the Trapezoid 
concept the control group moans were between 6.58 and 
9.50 v/hile the experimentals wore between 8.32 and 
11. Oi^-. The corresponding ranges for Isosceles Triangle 
were 6.92 to 10.72 for the control groups and 10.00 
to 12.15 for the experimental groups. 

Table 8 presents this sarae general picture in terms 
of tho significance tests of tho differences betv/een 
means of experimental and control groups. Examination 
of tho table reveals that only four of tho treatment 



coKibinations suocooded in producing a significant in- 
crease in the learning of the concept, Rectangle, while 
all except two treatments produced significant increases 
for both the concepts. Isosceles Triangle and Trape- 
zoid. On the concept. Quadrilateral, 9 of 16 combina- 
tions resulted in significantly higher scores by the 
experimental groups. 

While the results of a short pilot study and an 
investigation of curricular materials had indicated 
that the four concepts chosen would be equally unlmown 
to most fifth graders, this obviously did not turn out 
to be true. Tlie scores on tho Rectangle item indicate 
that many fifth graders have a partial concept of rec- 
tangle and that the experimental treatments did little 
to improve this concept. 

Results of control groups on the Quadrilateral 
items also suggest some partial Imowledge of that con- 
cept. Another explanation is that youngsters v/ith an 
adequate concept of triangle could eliminate certain 
items on the test while guessing at the rest and, thereby 
attain a better than chance score. In fact, one who 
knew vjhat a triangle’''" was and eliminated tho three 



•'‘■An examination of a sample of 101 tests by control 
subjects showed that 79 correctly identified all three 
triangles as not-quadrilaterals. 
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triongloa as posaiblo quadrilaterals while guessing 
at the remaining twelve items would have an expected 
score of 9.00 which is quite close to the 9.06 median 
of control group moans. Examination of the t-ratios, 
in the light of this interpretation, loads to the con- 
clusion that there was less learning of the Quadrilat- 
eral concept than of the Isosceles Triangle or Trape- 
zoid ones but considerably more than of Rectangle. 

The Isosceles Triangle and Trapezoid concepts 
consistently resulted in significantly greater achieve- 
ment by the experimental group? with only Number 11, 
for which the results were influenced by prior instruc- 
tion, and Number 14, for which the t-ratio on the 
trapezoid concept was 2.01 as opposed to a critical value 
at the .05 level of 2.02, not producing significant dif- 
ferences. Control group moans for the Isosceles Tri- 
angle concept were slightly above chance values (median 
value was 8.26), probably for similar reasons to those 
discussed above. Although all except one of the fifteen 
items malting up the question on Isosceles Triangle were 
perceptually similar to, or actually wore, triangles, 
knowledge of the concept of triangle could result in 
eliminations which would alter the chaj^ce expectations. 

On the Trapezoid items, with almost no extraneous cues 
to aid- the subject (no triangles ws.ro used), the median 
of the control group means was 7.81, only .31 above 
chance expectation. 
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The roaponaos to the treatments varied among con- 
cepts and general statements about the efficacy of the 
various treatments across all concepts are difficult 
to formulate. One approach is to rank order the experi 
ments and examine those treatment combinations which 
uniformly have high ranks. Since there seems to have 
been partial knoxi^ledge of the Rectangle concept apart 
from that gained through the seveiaL treatments, only 
the other throe will be considered. Table 9 presents 
the treatment combinations, their rank ordering on each 
of the throe concepts (in terms of E-C) and the moan 
ranking of each treatment. Studying the moan raiikanga 
indicates that combinations 10, 1 , 6, Ij., 9, and 12 v/ere 
the most effective over all concepts. 

Tho result that Number 10 was most consistent in 
producing high levels of learning across all concepts 
is not unexpected as it represents the strongest com- 
bination presented. Nino instances vjere used instead 
of six, succeeding instances were presented against a 
background of all preceding instances, a verbal cue 
accomi^anied the slide presentation and the instances 
wore projected for 12 rather than 8 seconds. 

One might have expected combinations 11, 13, and II4. 
to yield the next best results, as they contained the 
one lovol of three of the four variables in the final 
design. Hovjover, those three combinations yielded 
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TABLE 9 



Treatment oombinations, rank orderings of the 
differences in means (E-C) for three concepts 
and the means of these three rankings. 
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consistently poor results, indicating an interaction 
botvrocn the vo,riables of iriodo of presentation, and amount 
of rodundmicy when combined with the added verbal cue 
and increased time per instance. The added redundancy 
with only the focus instance presented simultaneously 
and the presentation of a more limited set of instances 
against a background of all previous instances both led 
to poor results when combined with the 1 levels of ver- 
bal cue and time per slide. The response to these 
three combinations yielded results somewhat loxvor than 
those combinations which containod the high level of 
only one vario,ble. 

The basic set of eight experiments consisted of 
one at the low or 0 level of all seven variables and 
one each with a single variable set at the next higher 
or 1 level. Examination of the results of those exi^eri- 
ments, either in terms of the difference between means 
of the two groups (Tables 6 and 8} or in terms of the 
mean of the experimental group (Table 7 ) shows that the 
variables of verbal cue (Number 6) and time per slide 
(NuiTibor 7 ) produce the greatest effect. Compared to 
the base combination, Number 1, the addition of a one 

word verbal cue increased the response (E-C) from 4.83 

. ... 

to Increasing the time each instance was availa.ble 

from 8- seconds to 12 seconds resulted in an increased 
rospoDse of 7 •23* 



o 

ERIC 

hiaifflifftilTITiaiJ 



Ill 



The only other single variable that produced an 
effect substantially above that of the base combination 
was that of ratio of positive and negative instances 
(Number Ij.). The change from all positive instances to 
a ratio of two positive to one negative led to an in- 
creased response of 5»5l* 

In summary^ several points about the concept attain- 
ments aspects of the experimental sequence should be 
emphasized. Measurable, significant amounts of con- 
cept learning wore obtained consistently from extremely 
short exposures to the sets of instances of each con- 
cept. This leading was not measured immediately, but 
after a delay of twenty-four hours and the amount of 
learning compared favorably to that of a group which 
inadvertently had had three arithmetic lessons on these 
concepts approximately ten weeks earlier. Examination 
of performances across the several concepts indicates 
no relationship between theoretical complexity of the 
concepts and the obtained responses. Finally, of the 
eight variables included in the experiments, those of 
verbal ouo, time per slide, md ratio of positive to 
negative instances led to the largest increments in 
concept learning. 



CHAPTER V 



SUMMARY AND CONCLUSIONS 

This dissertation has had two iriajor foci, that of 
response surface methodology and that of concept learn- 
ing. The experimontal techniques and strategies of the 
RSM model, which have proved to be extremely powerful 
in a vjido variety of industrial applications, have never 
been applied to a research problem in the behavioral 
sciences. While reasons for this lack of application 
may be easily identified, basic similarities between 
industrial and educational production do exist and 
seemed to justify the attempt to apply some of the RSM 
ideas to educational problems. The psychological area 
of concept learning is one in which much empirica], work 
has been done-~important variables have been identified 
and gross relationships among variables established. 
Furthermore, the learning of concepts v;ould seem a major 
purpose of schools. Yet, applications of the findings 
of concept learning studies to experimental investigations 
of learning in schools aro relatively difficult to find. 

It has been suggested that this paucity of applications 
may bo due to difficulties with the experimental task 
used to simulate concept leai’ning behavior. 
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Sinco tho body of empirical ovidonpo in tho con- 
cept learning area has identified a number of variables 
vihich approximated the roQuiremonts of the RSH model> 
the major purpose of this dissertation has been the 
application of the exploration phase of RSM to a concept 
learning task in a school setting. The results of the 
investigation are best discussed separately under the 
headings, concept learning and response surface explora- 
tion. 

Concept Learning 

A sequence of sixteen experiments was run over a 
two-month period. While the sequence was not long 
enough to lead to an experimental combination producing 
a maximum resi)onse, there were some interesting by- 
products of individually examining tho results of the 
several experiments. These include; 

1) Fifteen of sixteen experimental groups did 
significantly better than the corresponding 
control group on the isosceles triangle end 
trapo 7 ‘Oid items of the response measure. It 
should be noted that these results were ob- 
tained from extremely short exposures (botvjoen 
I 4.8 and 108 seconds per concept) to the rele- 
vant sets of instances, o?ho sole exception was 
one group that had received prior instruction 
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on these concepts in the course of arithmetic 
lessons . 

2) Lack of significant differences botvjeen experi- 
mental and control groups in the schools which 
had taught a unit on geometry some months earlier 
v/as strictly a function of higher scores by the 
control groxap. On none of the four concepts was 
the experimental group’s moan response high and 
on the trapezoid concept it was less than the 
median of the other experimental groups. 

3) The single variables that elicited the greatest 
improvement vjhen presented at higher levels 
wore those of verbal cue and time per slide. 

The improvement due to an increase in the pro- 
portion of negative instances used v;as the only 
other result substantially higher than that of 
the base combination (the 0 levels of all vari- 
ables). 

i^) The complexity of the concept to bo learned ap- 
peared to have no systematic effect. The two 
medium complexity concepts, Trapezoid and Isos- 
celes Triangle, elicited the largest differ- 
ences between exporimental ajid control group 
moans while the high complexity concept, Rec- 
tangle, produced the smallest difference. The 
unifoi’mly high performance of control groux:>s 
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on the rectangle concept points to prior knov/1- 
edge of the concept indicating that it was prob- 
ably a poor choice. 

Prom the above points several generalizations can 
be drawn. 

It is possible to obtain measurable amounts of 
concept learning by extremely short exposure to a series 
of positive and negative instances of that concept and 
without resorting to a definition of the concept. The 
variability in the effects of unit changes in the seven 
variables in the basic design does not necessarily re- 
flect relative importance or strength of the variables 
as it could be a function of the units chosen by the 
experimenter. However, the addition of a single word, 
attention -directing verbal cue, and the increase of four 
seconds in the time the subjects had to study each in- 
stance did produce responses substantially above the 
base levels of each, while an increase in the number of 
instances from 6 to 9 and presenting each new instance 
along with all previous ones did not. 

That verbal cue end the ratio of positive to nega- 
tive instences were tv;o of the throe ’variables produc- 
ing substantial increments above the base level may be 
a function of the indoterminateness of the universe 
within’ which the instances were being presented. That 
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is, because the universe was ill-defined, those changes 
in the treatments which directed attention to relevant 
dimensions or narrowed the frame of reference resulted 
in the largest increments in learning. 

An examination of the geometry materials in fifth 
grade ai-*ithmetic texts will show that concepts such 
as those used here are presented almost entirely with 
X^ositive instan.ces, often only one, and a written defini- 
tion. The fortuitous, but accidental, inclusion of a 
school in which the concepts had been taught with this 
general approach provides some insight into the effect- 
iveness of that instruction. One might expect that the 
experimental treatment v/ould have operated to strongly 
reinforce the earlier learning with resultant test scores 
considerably higher than other experimental groups. 

This did not happen, suggesting that classroom instruc- 
tion might benefit from more emphasis upon the unique 
aspects of the ai>proaoh used here, specifically the 
inclusion of more negative instances of the concept, a 
wider variety of instances, and allowing the student to 
infer the concept witnout a definition. 

Finally, the idea of the complexity of a concept, 
which is well defined in a finite universe, may not be 
the same in an infinite universe. There was no consistent 
order across the comi:>lexiuy dimensions but this may have 



been a result of the stated difficulties with the rec- 
tangle concept. 

Response Surface Exploration 

The attempt to explore the response surface for 
an area of maximum response was neither a success nor 
a failure. Time ran out before a definite evaluation 
of the technique could be made. However, the empirical 
results tend to substantiate the theoretical rationale 
on several points: 

1) The design did move in the direction of increased 
response vjith a sequence of six experiments be*™ 
yond the basic simplex resulting in a mean in- 
crease in the response of approximately one 
standard deviation. 

2) The possibilities of including additional vari- 
ables or rejecting ones that were having little 
effect was demonstrated by reducing the design 
from seven to four variables after the tenth 
experiment. 

3) The ability of the experimental strategy to 
lead to more optimal results in the presence 
of experimental error was displayed when the 
strategy led to a rejection of the results of 
experiment 11 (where treatment effects were 

» 

confounded with effects of prior instruction) 
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and a rerunning of that experimental combina- 
tion . 

i^,) The advantage of the simplex design over a fac- 
torial design in terms of reduced experimental 
effort was not shown. The advantage of the 
simplex procedure should not become apparent 
until one moves to the third or fourth level 
of some of the factors and the restricted length 
of the sequence did not permit this. 

In retrospect, too much v/as attem.pted in this 
study. The attempt to utilize some of the ideas of re- 
sponse surface methodology in maximizing the response 
meant that the full potential for evaluating ideas from 
the concept learning literature in this setting was not 
realized. Similarly, the desire to use a task and re- 
sponse measure which would avoid the criticisms dir- 
ected at other recent concept learning studies introduced 
delays v;hich prevented a thorough test of the exploration 
ideas taken from the response surface literature. The 
simple expedient of using a measure of irrmiediato learn- 
ing rather than waiting twenty-four hours to test would 
have permitted the running of twice as many experiments 
in the same time period. 

On. the other hand, it should be noted that in neither 
area has this study been a failure; it has merely been 



119 



indecisive. More effort on tho part of educational 
psychologists must be directed tov/ards the study of con- 
cept learning with tasks which are more accurate analogs 
of what teachers think of as concepts. Furthermore, 
the identification of combinations of variables which, 
within specified restrictions, produce maximum learning 
would seem a necessary pre-requisite to tho scientific 
planning of learning experiences. Educational researchers 
have not typically focussed on the problem of optimiza- 
tion of desired responses. However, it would seem that 
they should and it is suggested that a response surface 
conceptualization of many problems v/ould bo of value. 
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APPENDIX A 



Figures used as exemplars and non -exemplars of the 
concepts quadrilateral, isosceles triangle, trapezoid 
and rectangle. 



Quadrilateral 

Exemplars (positive instances) 





Non-exemplars (negative instances ) 



Isosceles Triangle 




Trapezoid 



Exemplars 




Non - exemp 1 ar s 
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APPENDIX B 



The following is the transcript of the instruc- 
tions and warm-up tasks presented to all experimental 
groups. After an informal introduction by the experi- 
menter, the following was presented via a tape record- 
ing. 



You are talcing part in an experiment to find out 
how well boys and girls your age can learn to identify 
certain kinds of geometric figures. We are also trying 
to determine what can bo done to make it easier to learn 
these figures. 



You may know something about some of the figures 
already. Even if you do we still want you to pay care- 
ful attention since some figures may have more than one 
name and wo would like you to know all the names. 

For example, this is a house. It is also a build- 
ing. It is also a shelter. 

(a line sketch of a house was projected.) 

This, also, is a building and a shelter but you 
would not call it a house. 



(The house was 
This tent is a 



replaced by a sketch of a tent.) 
shelter but it is not a building 



nor is it a house. 



Nov/ you are going to so© 
slides each showing a figure. 



some short series of 
You v/ill be told whether 



12 ? 
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the figure ia or is not an example of the kind of figure 
you are trying to learn* Your job will bo to try to 
decide just what it is that gives a figure its name. 

The first two series are practice ones about fig- 
ures which all of you know. They will give you some 
idea of what to expect from the more difficult sets that 
follow. 

(A series of five triangles vjas projected on the 
screen for eight seconds each with four seconds between 
each slide. Accompanying each slide was the verbal 
direction . . . ) 

Thia is a triangle. 

(A pause of 15 seconds follovjed the last slide. ) 

That was aji easy series. You should have been 
able to decide that a triangle is a figure or shape vjith 
three straight sides. 

The next set of slides will be somevdiat different, 
pay careful attention to the first slide since it will 
bo the only one to show you what the figure is. All 
the other sides will show you figures that do not fit 
the name. 

(The subjects v;ero then shovm a sequence of five 
slides, tho first of which was a square while the others 
were not. Each was accompanied by tho verbal direction . . .) 

This is (is not) a square. 



e 

ERIC 
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Are you still certain you know what a square is? 
Could you put it in words? 

(Pause for 15 seconds) 

A square is a four-sided figure. All four sides 

r 

are the same length. It also has four corners or angles 
all of which are alike. 

We call this kind of corner or angle a "square cor- 
ner" or "right angle." 

(A slide v;ith two linos intersecting in a right 

* ^ 

angle was projected. ) 

The remaining sets of slides will bo more difficult 
since you probably will not knovj the words as you did 
with triangle and square. Remember that some figures 
may have more than one naine. As I said before "all 
houses are buildings" but "not all buildings are houses." 
The same thing applies to some of the figures you will 
see. 

Try to learn as much as you can from each set of 
slides. After each set you vjill not be told vjhat the 
correct answer is, but you will be asked a question to 
see if you have understood what kind of figure fits the 

V 

name . 

(Pause) 

In this first series of slides you will bo trying 




to find out V7hat a quadrilateral is. 
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(This was followed by the particular seq^iences of 
slides comprising the experimental treatments for the 
concept, quadrilateral.) 

You should now have a good idea what it is that 
makes a quadrilateral. Gould you draw one? Using your 
ruler try to draw a quadrilateral in the space after 
number 1 on the papor you wore given. 

(Pause 25 seconds.) 

Now try to draw another one right next to it, but 
make this quadrilateral as different as you can from 
the first one and still have it be a quadrilateral. 

(Pause 25 seconds.) 

Now let us look at another set of slides. This 
time you will bo trying to decide what a trapezoid is. 

(This was followed by the sequence of slides on 
the concept, trapezoid. ) 

Could you new say what it is that maJees a shape or 
figure a trapezoid? Try writing out in vjords what you 
ichink a trapezoid is. Write your definition after nmn- 
bor 2. 

(Pause 45 seconds.) 

Let us try another set of slides. You will be 
trying to learn what an isosceles triangle is. 

(This was followed by the sequence of slides on th 
concept, isosceles triangle.) 

VJhat makes an isosceles triangle a special kind of 
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triangle? Were you able to figure it out? Using your 
ruler draw an isosceles triangle after number 3 on 
your paper. 

(pause 30 seconds.) 

Let’s see if you can draw another one. Make this 
isosceles triangle as different as you can from the 
first one and still have it be an isosceles triangle. 

(Pause 30 seconds.) 

We have one more series of slides to show you to- 
day. This word may be more familiar than the last three 
but it will not be any easier to learn. You will bo 
trying to decide what a rectangle is. 

(This was followed by the sequence of slides on 
the concept, rectangle. ) 

Now you should know what a rectangle is. Write 
your definition of a rectangle after number I4.. 

(Pause 45 seconds.) 

Let’s see if you can spell all the words you learned 
today. Turn your papers over. After number 5 write the 

word, trapezoid trapezoid. That is a hard one but try 

your best. 

(Pause. ) 

After number 6 write the word quadrilateral---quad- 
ri lateral. 

(Pause. ) 

After number 7 write the word rectangle rectangle. 
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(Pause. ) 

Now the last one is a little harder*.. Try writing 

the words isosceles triangle after number 8 isosceles 

tri angle. 

(At this point the tape recorder was turned off, the 
students were cautioned not to discuss the treatment 
until after the experimenter *s return the next day, and 
sent bade to their classrooms.) 



APPENDIX C 



GEOMETRIC FIGURE IDENTIFICATION TEST 

1. Circle each figure below which is a QUADRILATERAL, 
Do nothing to the other figures. 




2, Circle each figure below v;hich is an ISOSCELES TRI 
ANGLE, Do nothing to the other figures. 
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3. Circle each figure below v;hich is aTRAPEZOID, Do 
■ nothing to the other figures. 




4* Circle each figure below which is a RECTANGLE. Do 
nothing to the other figures. 




APPENDIX D 



LETTER SOLICITING PARTICIPATION BY SCHOOL DISTRICTS 



THE UNIVERSITY OP WISCONSIN 
2218 University Avenue 
Madison, VJisoonsin 53705 

Research, and Development Center 

For Learning and Re-Education Maa’ch 17> 1965 



Dean Lindley Stiles, Professor Herbert J, Klaus- 
meier, and Mr. Chester W. Spangler of the Research and 
Development Center for Learning and Re-Education of the 
University of Wisconsin invite your cooperation in a 
research project designed to increase understanding of 
factors that affect children’s learning of concepts. 

We propose to learn how various factors connected v/ith 
presenting information to fifth- and sixth-grade children 
affect hovj well they learn and remember concepts in 
geometry through a single short audio-visual (slides and 
tape recording) presentation of plane geometry materials. 
We hope that this information V7ill be useful in improv- 
ing learning mathematics in the elementary school. Your 
cooperation would include the follovjing; 

1) Pill out the short information form which you 
find enclosed and return to the project director. 

2) Furnish the project director with a list of 
your fifth -grade students prior to the day of the ex- 
periment. This will allow him to randomly assign stu- 
dents to experimental and control groups. 

3) Allow the student’s experimental team to work 
with the selected experimental group (approximately 20) 
for one hour on a prescheduled day. A room which is 
adequate both for the viewing of slides and writing will 
bo needed for the experiment. The remainder of the fifth 
graders vjould be left v/ith your teachers in another room. 

4) Allow the experimental team to return the fol- 
lowing day to administer to all fifth graders a short 
test covering the materials presented. 



135 



136 



. This is all your cooperation will require. We 
have attempted to make the intrusion into the normal 
school program as short and unobtrusive as possible. 

We should like to emphasize that v;e have absolutely 
no interest in making comparisons either between teachers 
or among schools. A summary report of the project will 
be sent to all schools when it becomes available. 

Robert C. Remstad, a project assistant on the Cen- 
ter staff, will be in charge of the experiment. H© 
will contact you within the next ten days to ansvjer any 
questions you may have and to enlist your participation. 
\Je trust you will allow us to use some of your students 
for this purpose. 



Yours sincerely. 



Herbert J. Klausmeier, co-director 
Research and Development Center 
for Learning and Re-Education 



APPENDIX E 



The following VJisoonsin eleraentary schools par- 
ticipated in this study by allowing the experimenter to 
use their fifth grade students as subjects for the se- 
quence of experiments. 



Black Earth- -Maz Oman ie 

Black Earth Eleraentary 
Mazomanie Eleraentary 

Bl anchardvi lie 

Blanchardville Elementary 

Boscobol 

Bosoobel Elementary 
De Forest 

De Forest Elementary 
Dodgoville 

Dodgeville Elementary 

Elroy — Kendall — Wilton 
Elroy Elementary 

Fennimore 

Fennimore Elementary 

Hillsboro 

City Elementary 

McFarland 

McFarland Elementary 
Mauston 

Lacrosse Street Elementary 



New Lisbon 

Now Lisbon Elementary 
Oakfield 

Oakfield Eleraentary 
Stoughton 

Kegonsa Elementary 

Sun Prairie 

Norths ide Elementary 
Souths ide Elementary 

Verona 

Verona Elementary 



Mount Horeb 

Mount Horeb Elementary 



New Crlarus 

Now Glarus Elementary 



